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ABSTRACT 

Numerical hydrodynamical simulations are used to predict the expected absorption 
properties of the Lya forest for a variety of Cold Dark Matter dominated cosmo- 
logical scenarios: CHDM, OCDM, ACDM, SCDM, and tCDM. Synthetic spectra are 
constructed duplicating the resolution, signal-to-noise ratio, and wavelength coverage 
of several published high resolution spectra, and their statistical properties compared 
on the basis of the flux distribution of the spectra, the distribution of coefficients in a 
wavelet decomposition of the spectra, and the distributions of absorption line profile 
parameters. Agreement between the measured and predicted cumulative distributions 
is found at the few to several percent level. The best-fitting models to the flux distri- 
bution correspond to normalizations on the scale of the cosmological Jeans length of 
1.3 < (Tj < 1.7 at z = 3. No single model provides a statistically acceptable match to 
all the distributions. Significantly larger median Doppler parameters are found in the 
measured spectra than predicted by all but the lowest normalization models (CHDM 
and tCDM), which provide poor fits to the flux distributions. The discrepancy in 
Doppler parameters is particularly large for absorption systems optically thin at the 
Lya line-centre. This may indicate a need to introduce additional energy injection 
throughout the intergalactic medium, as may be provided by late He ii reionization 
(zfjoii — 3.5) or supernovae-driven winds from young galaxies, and/or a larger baryon 
fraction than given by recent determinations of the deuterium abundance within the 
context of standard Big Bang Nucleosynthesis. The models require a hydrogen ioniza- 
tion rate at redshifts 1.7 < z < 3.8 within a factor of 2 of that predicted from QSOs 
alone as the sources of the UV photoionization background, although with a slower 
rate of decline with redshift at z > 3.5 than predicted from current QSO counts. 
Principal systematic uncertainties in comparing the models with the observations are 
the setting of the continuum level of the QSO spectra and the prevalence of metal 
absorption lines, particularly at z < 3. 

Key words: hydrodynamics - methods: numerical - intergalactic medium - quasars: 
absorption lines - cosmology: theory ~ large-scale structure of Universe 



1 INTRODUCTION 

The successful recovery of the observed statistical properties 
of the Lya forest by numerical simulations in the context 
of various cold dark matter (CDM) dominated cosmologies 
suggests that the gravitational instability scenario provides 
a broadly accurate description of the development and evo- 
lution of the structure of the intergalactic medium (IGM) 
(Cen et al. 1994; Zhang, Anninos, & Norman 1995; Hern- 
quist et al. 1996; Zhang et al. 1997; Bond & Wadsley 1997; 
Theuns, Leonard & Efstathiou 1998a). The simulations show 



that nearly the entire IGM fragments into filaments, sheets, 
and fluctuations in underdense minivoids, all of which give 
rise to the absorption lines comprising the Lya forest de- 
tected in the spectra of Quasi-Stellar Objects (QSOs) (Cen 
et al. 1994; Mhalda-Escude et al. 1996; Zhang et al. 1998). 
By analysing synthetic spectra drawn from the simulations, 
it has been demonstrated that the measured flux distribu- 
tions, neutral hydrogen column density distributions, and 
the evolution in the number of absorption lines per unit 
redshift are reasonably well accounted for by the complex 
web of interconnecting structures found in the simulations. 
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The structure of the Lya forest is intermediate in com- 
plexity between that of the Cosmic Microwave Background 
(CMB) and the large-scale distribution of galaxies. It may be 
reasonably hoped that the physics of gravitational instability 
and hydrodynamics alone, once a model for the rcionization 
of the IGM is assumed, is adequate for describing the mea- 
surements of the Lya forest. In this sense, the Lya forest 
represents a bridge between the relatively straightforward 
mechanisms believed responsible for the formation of the 
CMB fluctuations (given an input cosmological model and 
linear power spectrum) , and the complex physical processes 
involved in galaxy formation. For this reason, the Lya forest 
oS'ers a unique forum for testing models of structure format 
tion. 

The Keck HIRES (Hu et al. 1995; Lu et al. 1996; 
Kirkman & Tytler 1997) and, more recently, the VLT UVES 
(Cristiani & D'Odorico 2000) have provided high resolution 
(6- 9 kins^"' ), high signal-to-noisc ratio (50-100 per pixel) 
observations of the Lya forest that resolve the structure of 
the absorption features, and have dramatically improved the 
precision with which model predictions may be compared 
with the measured statistical properties of the Lya forest. 
The high precision of the data has elicited similar precision 
in the predictions from numerical simulations. Comparisons 
between the model predictions and the observations have 
boon effected by several groups by using the simulation re- 
sults to synthesize spectra that mimic the pixelization and 
typical noise characteristics of the measured spectra. Adopt- 
ing this approach, Dave et al. (1997) compare the predic- 
tions of their standard CDM (SCDM) simulation for the 
distributions of H i column densities and Doppler param- 
eters, as derived from Voigt profile absorption line fitting 
to synthesized spectra, with the distributions reported by 
Hu et al. (1995); Ranch et al. (1997) compare the pre- 
dictions of an SCDM model and a flat CDM model with a 
non- vanishing cosmological constant (ACDM ) for the distri- 
bution of pixel fluxes with distributions measured from sev- 
eral Keck HIRES QSO spectra; Theuns et al. (1998b, 1999) 
compare the predictions of an open CDM (OCDM), SCDM, 
and ACDM simulations for the distributions of absorption 
line parameters with those reported by Hu et al. (1995), Lu 
et al. (1996), and Kim et al. (1997); Weinberg et al. (1999) 
provide a preliminary low-resolution comparison between 
the predicted flux per pixel distributions of several cosmo- 
logical models with that measured in a sample of Keck 
HIRES spectra; and McDonald et al. (2000b) compare 
the prediction of the ACDM simulation of Miralda-Escude 
et al. (1996) for the distribution of pixel fluxes with the mea- 
sured distribution averaged over several HIRES QSO spec- 
tra. (In addition to mimicking the pixelization of the HIRES, 
Rauch et al. 1997, Theuns et al. 1998b and McDonald 
et al. 2000b also incorporate the effect of the spectral reso- 
lution of the HIRES in their synthetic spectra.) Because of 
the required small box sizes of the simulations, the synthetic 
spectra are much shorter than those measured, which cover 
a fairly broad range in redshift (typically Az « 0.5). This 
limitation has necessitated several approaches that degrade 
the precision of the comparison between the simulation pre- 
dictions and the data: assuming that an average distribu- 
tion measured over a range of redshifts is the same as the 
distribution at the average redshift (Dave et al. 1997; The- 
uns et al. 1998b, 1999; McDonald et al. 2000; Machacek 



et al. 2000); correcting the measured optical depths accord- 
ing to an assumed evolutionary rule (Raucli ct al. 1997, who 
assumed the pixel optical depths scale as t{z) oc (1 -|- z)^'^); 
or comparing the predicted and measured distributions over 
broad redshift bins (McDonald ct al. 2000). 

In addition to the tests above, there has been a par- 
allel effort to compare the predictions of the simulations 
for the flux power spectrum (Croft et al. 2000 and refer- 
ences therein; McDonald et al. 2000b) and auto- and cross- 
correlation functions (Zuo & Bond 1994; Miralda-Escude 
et al. 1996; McDonald et al. 2000b) with the observa- 
tions. These approaches target the measurement of large- 
scale clustering (low k) properties of the IGM and, implic- 
itly, of the underlying dark matter, more than the smaller 
scale (high A;) absorption properties of the Lya forest (al- 
though the two are related), as a means for constraining 
models of large-scale structure formation. In this paper we 
concentrate on the smaller scale properties of the IGM and 
make no further mention of these complementary statistics. 

The comparisons between the simulations and data on 
the basis of the flux and H i column density distributions 
have consistently produced very good agreement. An out- 
standing difficulty is the somewhat poorer agreement found 
for the distribution of Doppler parameters. The simula- 
tion convergence tests of Theuns et al. (1998b) and Bryan 
et al. (1999) show that as the resolution of the simulations 
increases, the absorption lines tend to narrow, although the 
trend need not be strictly monotonic (Meiksin & White 
2001). As a consequence, the SCDM, OCDM and ACDM 
simulations predict smaller Doppler parameters than mea- 
sured (at least when restricted to A'hi > 10^^ cm~'^ sys- 
tems), without the introduction of additional heating of 
the IGM (Theuns et al. 1999; Bryan & Machacek 2000; 
Machacek et al. 2000; Theuns, Schaye & Haehnelt 2000). 
The improved agreement found by Machacek et al. (2000) 
between the predictions of a tilted CDM (tCDM) model 
and the distribution of Doppler parameters reported by Kim 
et al. (1997), however, suggests that the case for disagree- 
ment may depend on the assumed cosmological model. 

While the concordance between the simulation predic- 
tions and measurements of the Lya forest has been good, 
statistical quantifications of the level of agreement have gen- 
erally been absent, rendering it difficult to assess the impor- 
tance of discrepancies, as in the Doppler parameter distri- 
butions. It also leaves open the more general question: are 
the predictions of the numerical simulations consistent with 
the observations at a statistically acceptable level? It is a 
central goal of this paper to address this question. Progress 
toward an answer has recently been made by McDonald 
et al. (2000b), who quantify the agreement between the pixel 
flux distribution for a ACDM simulation and the mean mea- 
sured distribution based on several HIRES spectra. To do so, 
however, they split the flux values into several (21) bins and 
use the statistic as a basis for comparison, finding agree- 
ment between the simulated and measured distributions for 
z > 3, but not at z = 2.41. Given the high resolution and 
wide wavelength range of the spectra, a far greater number 
of degrees-of-freedom are available, permitting more strin- 
gent tests. For a typical pixel number of A?" ~ lO**, it should 
be possible to measure the cumulative fiux distribution to 
a precision of l/AT^/^ ss 0.01, or perhaps a few times this 
allowing for correlations in the flux values of neighbouring 
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pixels. No group has fully exploited the information available 
from Voigt profile fitting, which typically produces several 
hundred lines per spectrum. 

A principal theme of this paper is to determine how 
closely the simulation predictions match the statistics of the 
Lya forest, as extracted directly from the measured spec- 
tra, and to determine the limitations involved, both obser- 
vational and theoretical. To do so, we use a suite of sim- 
ulations for a variety of cosmological models to construct 
synthetic spectra designed to match the resolution, pixeliza- 
tion, wavelength coverage, and noise properties of a set of 
measured spectra published in the literature. Both the syn- 
thesized and measured spectra are analysed using identical 
software to minimize any differences due to the arbitrary na- 
ture of the algorithms. In this way, we are able to determine 
how sensitive the statistics of the absorption features are to 
the properties of the spectra, including data-reduction sys- 
tematics, like the uncertainty in the continuum-fitting. We 
employ several statistical tools: the distribution of flux per 
pixel, the distributions of H i column density and Doppler 
parameters, and a novel approach based on the Discrete 
Wavelet Transform (DWT) of the data (Meiksin 2000). 

This paper differs from previous work in several im- 
portant regards: 1. We simulate the full wavelength cov- 
erage of the measured spectra (between Lya and Ly/3 ), 
to the wavelength precision of the spectra, incorporating 
the specific noise characteristics of each spectrum. 2. We 
consider a broader range of statistical tests, and employ 
identical software to analyse both the simulations and the 
measured spectra. 3. We introduce statistical quantifications 
of the differences between the model predictions and the 
observations, including analyses of the probability distri- 
butions of the tests employed. 4. We consider a broader 
range of cosmological models than has been done previ- 
ously in the context of the full set of tests considered. 5. 
Lastly, our fluid computational method differs fundamen- 
tally from those employed in most of the other simulations. 
While most use Smoothed Particle Hydrodynamics (SPH) 
with artificial viscosity to solve for the hydrodynamics part 
of the simulations, we use an Eulerian (in the co-moving 
frame) finite-difference shock-capturing scheme employing 
the Piecewise Parabolic Method (PPM). The exception is 
the ACDM model of Miralda-Escude et al. (1996), who also 
used an Eulerian shock-capturing scheme, based on the to- 
tal variation diminishing scheme. We believe it important 
to consider simulations using different numerical schemes, 
particularly when making high precision comparisons with 
the data. 

Our ultimate goal, toward which this paper is directed, 
is to determine whether or not the data and the simulations 
are of adequate accuracy to distinguish between differing 
cosmological models and reionization scenarios. If so, the 
simulations should provide a valuable independent tool for 
testing models of structure formation as well as probing the 
physical state of the gas and the energetic processes that ac- 
companied the formation of early sources of photoionization, 
whether QSOs, galaxies, or smaller stellar systems. 

The paper is organized as follows. In the next section 
we describe the methods used to compare the simulation re- 
sults with the measured spectra. In §111, the analysis results 
for seven QSO spectra are presented. A discussion of the re- 
sults follows in §IV. Our conclusions are summarised in §V. 



Various tests of the analysis procedures and absorption line 
fitting algorithms are presented in the two appendices. 



2 ANALYSIS METHODS 
2.1 Models and spectra 

The models used in the paper are described in detail in 
Machacek et al. (2000) (MOO) and summarised in Table |l[ 
All the models considered are in the context of CDM dom- 
inated cosmologies. The following six models were simu- 
lated using Kronos (Bryan et al. 1995), a combined Par- 
ticle Mesh (PM) -|- PPM gravity-hydrodynamics code: a 
standard critical-density flat CDM model (SCDM), two flat 
CDM models with a non-vanishing cosmological constant 
and low (ACDMl) and high (ACDMh) baryon densities, 
an open CDM model (OCDM), a tilted flat CDM model 
(tCDM), and a flat critical-density mixed dark matter model 
with a hot component added to the CDM (CHDM). Ah 
the power spectra have been normalized to be consistent 
with the abundances of rich clusters (White, Efstathiou & 
Frenk 1993; Bond & Myers 1996; Eke, Coles & Frenk 1996; 
Pierpaoli, Scott & White 2000), as the scales involved are 
near the scales relevant to the formation of the structures 
giving rise to the Lya forest. In addition, the tilt adopted 
in the tCDM model was chosen also to match the normal- 
ization on large scales (although only marginally) as deter- 
mined from the COBE measurements of the CMB (Bunn 
& White 1997). The ACDMh model was designed to yield 
larger Doppler parameters than the ACDMl model in an at- 
tempt to better match the measured values. Accordingly, in 
addition to increasing the baryon density, we have boosted 
the rate of He ii heating by QSO sources by a factor of 1.8 
to mimic the sudden onset of He ii photoionization at late 
times (Bryan & Machacek 2000). There is some uncertainty 
as to the best value of the baryon density flbh^ to adopt, as 
there is some spread depending on the method by which it 
is measured. Measurements based on deuterium abundances 
within the context of standard Big Bang Nucleosynthesis 
(BBN) (Buries & Tytler 1998; O'Meara et al. 2000) yield 
lower values than the best estimates based on measurements 
of the acoustic peaks in the CMB (Netterfield et al. 2001; 
Pryke et al. 2001; Stompor et al. 2001). The values adopted 
for all the models are consistent with the limits of Copi, 
Schramm & Turner (1995) based on a variety of observa- 
tional constraints. The value used for the ACDMl, OCDM 
and SCDM models is at the 3a lower limit of Buries & Tytler 
(1998) and O'Meara et al. (2000) based on measurements of 
the deuterium abundances in intergalactic gas clouds, and 
assuming no destruction of primordial deuterium. (Absorp- 
tion systems with low metallicity were specifically chosen 
by these groups to ensure the effects of stellar processing 
were minimal, but a small amount of processing would in 
principle permit a lower value for Qbh^ ■) The value for the 
ACDMh model agrees with the best estimate of O'Meara 
et al. (2000) and Pryke et al. (2001). The value used for the 
CHDM and tCDM models is midway between the best esti- 
mates of O'Meara et al. (2000) and Stompor et al. (2001). 

The initial data for all models except the CHDM 
and ACDMh models were generated using COSMICS 
(Bertschinger 1995) with the BBKS transfer function 
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(Bardeen et al. 1986) to compute the starting redshifts 
and the unconstrained initial particle positions and velocity 
perturbations appropriate for each model. For the CHDM 
model we used CMBFAST (Seljak & Zaldarriaga 1996) to 
generate the initial conditions. For the ACDMh model the 
power spectrum was based on the analytic fits of Eisenstein 
& Hu (1999). 

We use the synthetic spectra described in MOO. The 
spectra are resampled according to the pixelizations given 
in Table ^ and the spectral resolution mimicked by smooth- 
ing the spectra with a gaussian of the indicated widths. 
Noise was added according to the varying signal-to-noise 
ratio across each measured spectrum, as described below. 

In order to construct spectra with the identical wave- 
length coverage as the observed spectra, the simulation re- 
sults from a broad range in redshifts must be used. Since the 
simulation results are saved at only integral redshift values 
between z = 1 and z = 5 (except for the ACDMh model, 
for which the data are saved more frequently), it is neces- 
sary to interpolate between redshifts. (We do not use the 
2 = 1 results since the fundamental mode across the simu- 
lation volume has grown too large by this time to be certain 
of the reliability of the results.) To assist in the interpo- 
lation, we take advantage of the redshift scaling relations 
of the gas properties found by MOO. Because the gas is in 
photoionisation equilibrium, the optical depth at any given 
wavelength scales like t(A) oc aA{T) /\r{z)H{z)], where 
n is the gas density, aA{T) is the (Case A) radiative re- 
combination coefficient at gas temperature T, H{z) is the 
Hubble constant at redshift z, and r{z) is the incident rate 
of ionising photons per neutral atom. Since the morpholo- 
gies and overdensities of the overdense structures evolve only 
slowly in the comoving frame and T scales like the square- 
root of the gas overdensity (Zhang et al. 1998), and since 
aA{T) (X (approximately), we may expect the combi- 

nation 7i^aA(r) to scale like a power of the mean gas density 
n{z), which itself scales as (1 -I- z)^. Accordingly, we scale 
the optical depths according to t(z) oc (1 + z)'^ /[r{z)H{z)]. 
The value a = 6 is expected for a homogeneously expand- 
ing medium of fixed temperature. We find that in general a 
is model-dependent. For any particular cosmological model, 
we determine a between two redshifts zi and Z2 by finding 
the value which best predicts the mean optical depth, de- 
fined by fa = — log(exp(— r)) averaged over the spectra, at 
z — Z2 using the spectra ai z = zi. (In principle, a different 
value for a may be found by reversing the roles of zi and Z2. 
We checked this and found in all cases the same value for a 
was derived.) The values found are provided in Tables ^ and 
^ We allow for an overall rescaling of the radiation field by 
multiplying the optical depths by a scale factor s. (The fac- 
tor s could also be interpreted as a rescaling of the baryon 
density, although this would entail a change in the gas tem- 
perature, which we do not allow for.) The UV background 
radiation field adopted in the simulations evolves according 
to the model of Haardt & Madau (1996) predicted for QSOs 
as the sources of the radiation, and assuming an intrinsic 
QSO spectral index ag = 1.5. The radiation field is turned 
on at 6 < Zion < 7. We additionally accommodate an al- 
ternative evolution in the radiation field by allowing for a 
further evolution factor (1 -I- zY multiplying the Haardt & 
Madau rate TnM(z). Thus we predict r at any given wave- 



length A at redshift z based on the value at redshift zi at 
wavelength A(l + z)/(l + zi) according to 



\^ r-iiM{zi)H{zi) 



i(z)H{2 



(1) 



where Zj-^t is a reference redshift introduced for p 7^ 0. The 
values for s, p, and z^cf adopted for different cosmologies 
and required to match the measured spectra are provided 
below. 

The interpolation to intermediate redshifts to cover the 
wavelength range of a particular measured spectrum is per- 
formed by generating spectral segments from the simula- 
tions, each a fixed redshift interval Az = 0.1 in length, scal- 
ing the optical depth in each segment according to equa- 
tion 0, and then effectively piecing the segments together to 
match the wavelength range of the measured spectrum. In 
order to interpolate between two data dumps at redshifts zi 
and Z2, we compute the distribution functions f{x; z, Zd) for 
any given statistic x within a redshift interval (z, z ~\- Az) by 
first extrapolating the optical depths from a data dump at 
Zd to z, and then computing the final distribution function 
for a redshift interval zt < z < Za according to 



f{x) = ^ Az [w2,if{x; Zi, zi) + Wi,if{x; Zi, Z2) 



(2) 



using the linear weights Wj^i = \ {zi ~ Zj)/{z2 — 21) |, and in- 
crementing Zi from zi, to Za ■ We find this procedure recovers 
the distribution functions at intermediate redshifts to very 
high accuracy, as is shown in Appendix 

In all cases, the wavelength coverage z^ < z < Za 
corresponds to a range between Lya and Ly/3 in the 
QSO restframe to avoid confusing Ly/3 absorption with 
Lya absorption. In practice the ranges are usually shortened 
to a smaller interval, depending on the wavelength coverage 
of the spectrum and to avoid the local photoionizing influ- 
ence of the QSO (the proximity effect). 

A major uncertainty in comparing with the measured 
spectra is the setting of the continuum level. Normally peak 
flux points are used to determine the level of the continuum 
in the Lya forest region of the spectrum. In regions where 
the fiux never recovers to the true continuum, however, this 
procedure will result in an underestimate of the true con- 
tinuum level. Without an automated procedure for inferring 
the continuum within the forest, it is difficult to gauge the 
effect of the uncertainty in the continuum level. We find that 
making no correction to the continuum frequently results in 
a mismatch in the flux distribution for flux values near unity 
(in units of the continuum). We model errors in the contin- 
uum level by allowing for a shift in its overall value according 
to: 



(0) 



1 



(3) 



where .F"^'"' is the flux predicted by the numerical model at 
pixel i, and Fi is a corrected flux allowing for a downward 
shift in the continuum level by the fractional amount lo, 
applied uniformly across the spectrum. This is a crude ap- 
proximation to the actual errors likely to occur. We adopt 
it for its simplicity in order to assess the impact of possible 
continuum errors. For a more accurate comparison, it would 
be necessary to re-analyse the raw data and offset the simu- 
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lation spectra at each pixel following a procedure consistent 
with that of the observers. 

The fitting of absorption lines to the spectra requires 
a noise value for each pixel. We generate a noise matrix 
from the la noise levels in the observed spectra, averaging 
the noise in several flux bins over redshift intervals of width 
Az = 0.1. We then draw a noise value assuming the noise 
to be gaussian distributed with a standard deviation cor- 
responding to the matrix element of the redshift and flux 
of the pixel. This allows us to model a varying signal-to- 
noise ratio across a spectrum, and allows for a correlation 
between fiux and noise. We have verified that the statistical 
properties of the spectra are not very sensitive to the noise 
model; averaging over redshift bins but not allowing for a 
correlation between fiux and noise level yields very similar 
results. 

2.2 Statistical analysis tools 

There are several statistics that may be used as a basis of 
comparison between the models of the Lya forest and the 
observed spectra. The most fundamental is simply the distri- 
bution of flux per pixel. A high level of degree of accuracy in 
the cumulative fiux distribution may be achieved because of 
the large number of pixels in the spectra. The measured and 
predicted cumulative distributions may be compared using 
the Kolmogorov-Smirnov (KS) test, for which the expected 
value of the difference (in absolute value) (Iks between the 
distributions for A'' measurements is Jks — and 
(Iks should exceed only 1 per cent of the time. For 

a typical number of pixels N ~ 10*, the agreement between 
the predicted and measured cumulative fiux distributions 
should then be dKS ~ 0.01. The probability distribution is 
based on the assumption that the measurements are sta- 
tistically independent. Because the absorption features will 
introduce correlations between neighbouring pixel fiuxes (as 
will the resolution of the spectrographs), this assumption is 
formally violated. As is shown in Appendix the correla- 
tions effectively tend to reduce the number of independent 
pixels by a factor of 3-5. Even allowing for this, the test 
still provides a powerful means of quantifying the statistical 
agreement between the model predictions and the measured 
spectra. 

Additional constraints are provided by the fiuctuations 
in the spectra, which are related both to the underlying 
baryon density fiuctuations and to the thermal and veloc- 
ity widths of the associated absorption lines. We quantify 
the fiuctuations in two ways. The scales of fiuctuation of the 
spectral features may be directly characterised using the Dis- 
crete Wavelet Transform, which quantifies the changes in a 
given spectrum on being smoothed from one velocity reso- 
lution scale to another (Meiksin 2000). A multiscale analy- 
sis based on the Daubechies wavelets effectively performs a 
smoothing of the spectrum over successive doublings of the 
pixel width, weighted such that the spectrum is decomposed 
into orthogonal components corresponding to different ve- 
locity scales. (We use the Daubechies wavelet of order 20.) A 
wavelet analysis offers several advantages over the more tra- 
ditional Voigt-profile absorption line analysis: 1. the method 
is objective, in that it does not require making arbitrary de- 
cisions regarding the deblending of absorption lines or the 
acceptability of a fit; 2. the method produces a large number 



of wavelet coefficients, at the resolution level L for A'^ 

total pixels, corresponding to a typical (Iks ~ 0.01 x 2^*^^, 
thus enabling stringent statistical constraints to be set; 3. 
the coefficient distributions are insensitive to noise (Meiksin 
2000); 4. the analysis is extremely fast. Monte Carlo sim- 
ulations show that the wavelet coefficients are only weakly 
correlated, so that they may be treated as statistically in- 
dependent (Meiksin 2000). There are, however, some dis- 
advantages as well: 1. the light fiuctuations are quantified 
only over successive doublings of the pixel size, allowing 
only coarse velocity resolution in the statistical quantifica- 
tion of the fluctuations; 2. the method is indifferent to the 
origin of the fluctuations, and in particular does not distin- 
guish directly between the effects of H i column density and 
Doppler broadening, which may especially become mixed in 
the presence of substantial line-blending. For these reasons, 
we perform Voigt proflle analyses as well. 

We utilise AutoVP (Dave et al. 1997) to decompose the 
spectra into a set of Voigt absorption line proflles. Typically 
300-500 lines are found per spectrum, providing a preci- 
sion of dKS ~ 0.05 in the cumulative distributions of A^'hi 
and b. In order to determine the sensitivity of the resulting 
distributions of absorption line parameters to the line anal- 
ysis method, we have also developed a separate algorithm 
called SPECFIT. Rather than searching for absorption max- 
ima, and adjusting the number of lines required for a good 
fit, as does AutoVP, SPECFIT first splits the spectra into seg- 
ments bordered by fiux values above that corresponding to 
a minimum optical depth threshold Tmin, and then searches 
for infiection points in noise-filtered representations of the 
data in each segment in order to identify the locations of 
candidate absorption lines. The candidate lines are then fit 
to the original spectrum. While less sophisticated than Au- 
toVP, SPECFIT operates a factor of 50-100 faster with a 
comparable level of success in the fits, except that it some- 
times misses the weakest optically thin features (r < Tmin). 
The two methods are compared in Appendix 

In the following, we shall generally regard the 3a level 
(a KS test probability of Pks = 0.001) as the threshold for 
rejecting a predicted distribution compared with the mea- 
sured using the KS test. 



2.3 Data 

We compare the simulation results with six Keck HIRES 
spectra and one VLT UVES spectrum that have been pub- 
lished in the literature. The quasars, the redshift range used 
in the comparison, the pixel resolutions of the spectra, and 
the spectral resolutions, are summarised in Table |2| All the 
spectra were normalized to a unit contirmum level by the 
observers. 



3 RESULTS 

In this section, we present comparisons between the simula- 
tions and the data as quantified by the fiux per pixel distri- 
butions and the distributions of wavelet coefficients and ab- 
sorption line parameters. Because of differences in the char- 
acteristics of the spectra and the dispersion in the statistics 
extracted from the measured spectra, we present the results 
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for each spectrum separately. The reader interested primar- 
ily in the implications of the results may choose to skip this 
section and proceed to Section |^. 

The amplitudes of the distributions found in the mod- 
els cannot be used as a basis of comparison since the optical 
depths may be arbitrarily rescaled for any individual model 
by the ionization bias factor foion oc 0.1 (Hui & Gnedin 
1997; Croft et al. 1997). (We note the temperature-density 
relation found by Zhang et al. 1998 for systems in the col- 
umn density range 12.5 < logj^Q A*'hi < 14.5 is consistent 
with this dependence.) The dependence on Q.b is based on an 
approximate polytropic relation between density and tem- 
perature found to hold at low density (Meiksin 1994; Hui 
& Gnedin 1997), corresponding to the low and moderate 
column density systems which dominate the absorption in 
the spectra. The relationship, however, is not valid for higher 
column density systems (logjQ A''hi > 15) because these arise 
predominantly in high density regions (Zhang et al. 1998), 
for which the relationship inverts (Meiksin 1994; Theuns 
et al. 1998b). The scaling also neglects the effects of changes 
in the gas temperature on absorption profiles that will re- 
sult from a change in the baryon density, or of any associated 
pressure changes. For these reasons, it is most conservative 
to view a change in &ion as a change in F, which has little 
effect on the gas temperature or pressure. It is important to 
normalize all the models consistently with any given mea- 
sured QSO spectrum before comparing the shapes of any of 
the distributions with those measured. This may be accom- 
plished in a variety of ways. We do so by matching the mean 
H I optical depth fa in each simulation to the measured in- 
tergalactic H i optical depth over the comparison redshift 
interval. Since the resulting model flux distributions will of- 
ten mismatch the observed distributions for flux values near 
unity, we will also allow for a constant ofi'set in the con- 
tinuum as given by equation ^ while keeping the value of 
fa fixed at the observed value. In some instances it is also 
found necessary to introduce additional evolution in the UV 
radiation background to that adopted in the simulations. 



3.1 Q1937 1009 

3.1.1 Flux distribution 

We investigate Q1937-1009 (Buries & Tytler 1997) most 
thoroughly in order to assess the magnitude of some of the 
possible systematics in our analyses. Over the redshift range 
3.126 < z < 3.726, the mean optical depth in the measured 
spectrum is fa = 0.44. The mean optical depths in red- 
shift intervals Az = 0.1 wide are shown in Fig. |l|, along 
with estimates from each of the models. The optical depth 
per pixel has been rescaled according to equation Jy for each 
model, using the parameter values given in Table We note 
that the redshift dependences for the optical depth per pixel 
found in the simulations are generally steeper than the de- 
pendence expected in a homogeneously expanding universe: 
t{z) oc (1 + z)*^ /H{z) (assuming constant F). By contrast, 
the measured values of the mean optical depth fa are nearly 
flat with redshift. Formally they are well fit by fa = ail+z)^ , 



with a = 0.022 



-1-0.044 



and b = 2.0 ±0.74, where the standard 



deviation of the measured values of fa is adopted as a mea- 
surement error for each value. The models predict a near 
doubling of the mean optical depth over the same redshift 



0.6 



o 



0.4 



0.2 



)1937-1009 




4.2 4.4 4.6 

1 + z 



Figure 1. Redshift dependence of mean optical depth fa for 
Q1937— 1009 (horizontal bars). The open symbols show the val- 
ues for fa found in the simulations based on the evolution of the 
UV radiation background as determined from QSO sources by 
Haardt & Madau (1996). The filled symbols show the values for 
fa after allowing for an alternative redshift dependence of the UV 
radiation background. The symbols are: CHDM (circles), ACDMl 
(squares), ACDMh (diamonds), OCDM (triangles), SCDM (in- 
verted triangles), tCDM (pentagons). 



interval, corresponding to 6 = 6, significantly steeper than 
measured. It is evident that an alternative evolution rate 
to that of the simulations is required to match the redshift 
dependence of the mean optical depths. To match the ob- 
served evolution, we adopt the additional rate exponents p 
for Zrcf = 3 (cf equation |l|) as given in Table ^ The result- 
ing distributions of fa for the various models are shown in 
Fig. 0. 

The distributions of fiux per pixel for the various models 
are compared with the observed distribution in Fig. |^. (Flux 
values exceeding unity result from noise fiuctuations.) The 
agreement is generally good, particularly for the ACDMl 
model. The models, however, all show some disagreement 
at flux values near unity, and for this reason they are all 
formally rejected by the KS test. For the ACDMl model, 
the maximum difference (in absolute value) between the 
predicted and measured cumulative distributions is d^s ~ 
0.022, corresponding to a formal acceptance probability of 
Pks = 10"*. The next best model is SCDM, which gives 
dKS = 0.039 and Pks = 8 x 10"". 

Allowing for the possibility that the continuum level 
in the observed spectrum was set slightly low, resulting in 
overestimates of the normalized flux, we increase the fiux per 
pixel according to equation ^ choosing values for s and uj 
that simultaneously preserve fa = 0.44 and match the mea- 
sured flux distribution at flux values near unity. The result- 
ing flux distributions are shown in Fig. |^, using the values 
for s and uj in Table |^. The agreement in the cumulative dis- 
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Figure 2. Comparison between the measured flux distribution of 
Q1937— 1009 and the model predictions. Particularly good agree- 
ment is found for the ACDMl model, although it is formally 
rejected by the KS test. (The measured distribution has been 
rebinned in the upper panel for clarity.) 
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Figure 3. Comparison between the measured flux distribution 
of Q1937— 1009 and the model predictions. A continuum offset is 
applied to each model to enforce matching the cumulative distri- 
butions at flux values near unity. The distributions tighten com- 
pared with Figure jij The SCDM model is now preferred over 
ACDMl • (The measured distribution has been rebinned in the 
upper panel for clarity.) 



tributions is now tightened, with the SCDM model agreeing 
best with the observed distribution. For the SCDM model, 
dKS = 0.016, corresponding to a formal acceptance probabil- 
ity by the KS test of Pks = 0.011. By comparison, the next 
best models are ACDMh and ACDMl, with dKS = 0.031 
and 0.033, respectively, both of which are strongly rejected 
by the KS test, with respective acceptance probabilities of 
Pks = 4 X 10"^ and 5 x 10"^". 

We recall that the KS test is based on the probability 
distribution of N^''^dKS, where A'' is the number of inde- 
pendent measurements in the sample. In Appendix it 
is shown that the probability distribution for this quantity 
based on the simulated spectra is broader than the theoret- 
ical distribution, as may be expected if the flux values in 
neighbouring pixels are correlated. Because of the limited 
number of lines-of-sight drawn from the simulation volume, 
it is not possible to attach meaningful probabilities to large 
values of dKS directly from the simulations. It is shown in 
Appendix however, that the probability distribution is 
close to that of the KS test for an effective number of pixels 
reduced by a factor of 3-5 from the actual number. Allow- 
ing for this reduction, the KS probabilities for the SCDM, 
ACDMh , and ACDMl models (with continuum offsets ap- 
plied) become, respectively, Pks = 0.36 - 0.68, 0.002-0.037, 
and 0.001-0.024, stiU favouring SCDM, but not excluding 
the ACDM models. The predictions of the CHDM, OCDM 
and tCDM models, however, are still strongly rejected. We 
note that not allowing for the continuum offset correction 
for ACDMl gives the KS probability, allowing for the re- 
duced effective number of pixels, Pks = 0.081 — 0.29, so 
that the ACDMl model is perhaps no less viable than the 
SCDM model. The difference in the probability value with 
and without a continuum offset illustrates how susceptible 
Pks is to even a small degree of uncertainty in the continuum 
level. We note that while each of the ACDMl models dis- 
agrees with the measured flux distribution more than does 
the SCDM model, the predicted flux distributions bracket 
the measured distribution, so that an intermediate ACDM 
model may be expected to provide an even better match. 



3.1.2 Wavelet coejficient distributions 

The distribution of wavelet coefflcients is shown in Fig. ^ 
both with no continuum offset applied and with the off- 
sets in Table ^ The models generally show poor agreement 
with the measured distribution at the smallest velocity scale, 
^ 4 — 8kms~^ . It is possible that this is due to an inad- 
equancy in the modelling of the spectral characteristics of 
the Keck HIRES in the synthetic spectra. Better agreement 
with Keck data, however, was achieved by Meiksin (2000) 
using similar modelling for a set of Monte Carlo realisations. 
It may be that the disagreement results from limitations of 
the simulations. We discuss this further in Section ^ below. 
A discrepancy is also found on the next scale of 8-17 km s~^ , 
with haff the models (ACDMl , OCDM and SCDM) exhibit- 
ing too great an amount of velocity structure (as indicated 
by the larger values of the coefficients) . The discrepancy be- 
tween these models and the data suggests a greater number 
of absorption features predicted by the simulations on these 
scales than in the data. The lower normalization models, by 
contrast, predict a smaller amount of structure than in the 
data, while the ACDMh model is intermediate and agrees 
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Figure 4. Comparison between the measured wavelet coeffi- 
cient cumulative distributions of Q1937— 1009 (solid curves) and 
the model predictions with (dashed curves) and without (dotted 
curves) continuum offset corrections. The curves from left to right 
correspond to the approximate velocity scales 4—8, 8-17, 17—34, 
34-68, and 68-136 km s"! . The predictions of the CHDM model 
agree best with the measured distributions. 



Figure 5. Comparison between the measured absorption line pa- 
rameter distributions of Q1937— 1009 and the model predictions 
using AutoVP, allowing for an offset in the continuum level. 
While good agreement is generally found for the A^hi distri- 
butions, the predicted b distributions are generally skewed 
toward lower 6- values than measured. (The measured distri- 
butions have been rebinned for clarity of presentation.) 



best with the data (though it is formally rejected by the 
KS test, with cLks = 0.05 and Pks = 6 x lO"''). The dis- 
agreement at the next velocity scale of 17-34 km s~^ is gen- 
erally much worse. These velocities correspond to the typical 
Doppler parameters of the absorption features (see below). 
Only the CHDM model provides an acceptable match to the 
data on these scales; the other models predict far too much 
structure. The effect of the continuum offset correction is 
generally negligible on all velocity scales, except for CHDM 
and tCDM (which have the largest corrections), although it 
is small even for these. For CHDM, the KS test gives on the 
scale 17-34 km s"^ , dKS = 0.041 and Pks = 0.030 without 
the offset correction, and (Iks ~ 0.035 and Pks = 0.083 with 
the correction. The agreement between the models and the 
data tends to improve for all the models at larger velocity 
scales. 



3.1.3 Absorption line parameter distributions 

The H I column density and Doppler parameter distribu- 
tions resulting from fitting absorption lines to the spectra 
using AutoVP are shown in Fig. ^. The continuum offsets 
of Table |^ have been applied. For each model, effectively 
50 separate spectra were constructed from the simulation 
box corresponding to the identical spectral range analysed 
in Q1937— 1009. The profile analysis is computationally ex- 
pensive. For the range of column densities of interest, it is 
adequate to fit the spectra using Doppler profiles rather than 
full Voigt profiles. We find there is little difference in the re- 
sulting distributions, while a factor of 2 is saved in the com- 



puting time. Typically 2 — 3 x lO** features in total are found 
for each model, requiring an analysis time of ~ 1 — 2 x 10^ 
cpu seconds on a Compaq XP900 with a IGHz processor. 

The agreement in the distribution of A^'hi between the 
data and all the models is reasonably good, with the pre- 
diction of CHDM providing the best match, as is found in 
the wavelet analysis. The predicted b distributions for the re- 
maining models peak at substantially lower values than does 
the measured distribution. For both the A^hi and b distribu- 
tions, we find that the probability distribution for N^^^d-KS, 
where A'^ is the number of absorption lines, is narrower than 
the theoretical distribution, so that the rejection probabili- 
ties provided by the KS test below are perhaps conservative. 
We discuss this point further in Appendix ^ 

The cumulative distributions are shown in Fig. ^. The 
measured A''hi distribution is best matched by the CHDM 
model: the maximum difference in the measured and pre- 
dicted cumulative distributions is dKS = 0.083, with the 
associated formal KS acceptance probability Pks = 0.002. 
The next best model is tCDM, with ciKS = 0.10 and Pks ~ 
4 X 10~^. The CHDM model also provides the best match to 
the measured b distribution, with dKS = 0.071 and Pks = 
0.013. (The remaining models are very strongly rejected; the 
next best is tCDM with ^ks = 0.16 and Pks = 4 x 10"^^) 
It is clear that all the models other than CHDM predict 
far too narrow lines. The ACDMh model, which yields the 
largest fo- values of those models for which the predicted pixel 
flux distribution is consistent with the measured, predicts a 
median Doppler parameter of bmcd ~ 20.9 kms~^ . The mea- 
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Figure 6. Comparison between the measured absorption line pa- 
rameter cumulative distributions of Q1937— 1009 and the model 
predictions using AutoVP, allowing for an offset in the contin- 
uum level. The predicted A'^jjl distributions tighten about the 
measured distribution. The CHDM model agrees best with the 
measured A'^hi and b distributions. 

sured median b is 26.4 ± 0.9kms~^Q too large by 6a, but 
consistent with famed ~ 25.5 km s~^ predicted by the CHDM 
model. 

The narrowness of the absorption lines is reflected by 
the larger number of systems found in the AutoVP analyses 
of the simulation results: more lines are required to recover 
the mean measured optical depth of the spectrum. The Au- 
toVP analysis of the spectrum of Q1937— 1009 yields a total 
of 495 absorption features. The expected numbers of lines 
predicted by the models are for CHDM: 543; ACDMl: 592; 
ACDMh: 597; OCDM: 571; SCDM: 618, and tCDM: 585, 
all of which except CHDM exceed the measured number by 
at least 3a, assuming Poisson fluctuations. 

The effect of the continuum offset is small in compar- 
ison with the variances between the predicted and mea- 
sured A^Hi and b distributions. For instance, for the ACDMl 
model, which provides the best fit to the pixel flux distri- 
bution without a continuum offset correction, the median 
Doppler parameter is 20.2 km s~^ with no offset correction 
and 19.4 km s~^ with the correction. Both values are incon- 
sistent with the measured median 6- value by over 6a. The 

* The error in the median is (0.57r/Af)^/^o', where a is the stan- 
dard deviation of the distribution of fe- values, N is the number of 
lines, and assuming the distribution of fe-values is a guassian. The 
data and the simulations show a long tail toward high values, and 
so the error estimate should be modified accordingly (Kendall & 
Stuart 1969). We find that a lognormal distribution provides an 
acceptable fit to the distribution of 6-values derived by AutoVP. 
Using the lognormal distribution instead of a gaussian changes 
the error on the median by less than one per cent. 



Figure 7. Comparison between the measured absorption line pa- 
rameter cumulative distributions of Q1937— 1009 and the model 
predictions using AutoVP, for subsamples of lines optically thin 
(tq < 1) and optically thick (tq > 1) at Ly« line-centre. The 
agreement between the model predictions and the data is best for 
the optically thick systems. All the models predict much too low 
median 6- values for the optically thin systems. 



KS test formally strongly rejects the A^hi and b distribu- 
tions, compared with the measured distributions, for both 
cases. 

Absorption features with a line-centre optical depth 
To < 1 correspond to structures that are underdense at 
z — 3, while higher optical depth systems are associated 
with overdense gas (Zhang et al. 1998). Underdense gas at 
these redshifts is too rarefied to maintain thermal equilib- 
rium between photoionization heating and radiative losses, 
and so will retain a partial memory of its thermal history 
(Meiksin 1994). Overdense gas is more readily able to main- 
tain thermal equilibrium. Because of this physical difference, 
we split the line samples into two subsamples at ro = 1 to 
determine if the disagreements between the measured and 
predicted A'hi and 6 distributions may be due to an incor- 
rect thermal history rather than an incorrect cosmological 
model. The effect of this split is shown in Fig. 0. All of the 
models produce A'hi distributions for ro > 1 consistent with 
the data. The best case is CHDM, with (Iks = 0.10 and 
Pks = 0.041. The worst cases are the SCDM and tCDM 
models, both with (Iks = 0.14 and Pks = 0.002, while the 
ACDMl and ACDMh models give, respectively, (Iks = 0.12, 
Pks = 0.014 and dKS = 0.11, Pks = 0.018. Only the tCDM 
model is able to reproduce the b distribution for ro > 1 
(dKS = 0.06, Pks = 0.50). All the models predict a number 
of absorption systems consistent with the measured number 
of 179 (CHDM: 149; ACDMl: 205; ACDMh: 184; OCDM: 
214; SCDM: 206; tCDM: 161). 

The disagreement between the model predictions and 
measured distributions for the ro < 1 systems is more se- 
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Figure 8. Comparison between the measured absorption line pa- 
rameter cumulative distributions of Q1937— 1009 and the model 
predictions using SPECFIT, allowing for an offset in the contin- 
uum level. 



vere. None of the models is able to reproduce the measured 
h distribution. While the CHDM and tCDM models pre- 
dict the correct shape for the A^'hi distribution (both models 
give dKS = 0.06 and Pks = 0.2), they predict far too great 
a number of optically thin (ro < 1) lines compared with 
the measured number of 316, as do all the models except 
OCDM (CHDM: 395; ACDMl: 388; ACDMh: 413; OCDM: 
357; SCDM: 414; tCDM: 426). Thus it appears the discrep- 
ancy between the model predictions and measured distribu- 
tions arises primarily from the optically thin systems, al- 
though there is still a pronounced tendency for the models 
that agree best with the pixel flux distribution to predict too 
low &-values even for the optically thick (ro > 1) absorbers. 

We note that the measured median Doppler parame- 
ter for the optically thin systems, bmod = 28.6 ± 1.3 kms~^ , 
is signiflcantly higher than for the optically thick systems, 
6med ~ 25.6 ± l.Okms"^ . This is opposite the trend of a 
decreasing envelope of b-values for optically thin systems 
found at lower redshift (Hu et al. 1995; Kirkman & Tytler 
1997). It is also opposite the trend found in the simu- 
lations. The model predictions for femcd among the opti- 
cally thick systems are for CHDM: 27.7kms"^; ACDMl: 
20.9 kms~^; ACDMh: 23.1kms~^; OCDM: 18.8kms-^; 
SCDM: 20.4kms"^; tCDM: 25.3kms-^ Of these, only 
the CHDM, ACDMh, and tCDM models are consistent 
with the measured value. The predictions for bmed for the 
optically thin systems, however, are substantially smaller 
CHDM: 24.6kms-^; ACDMl: 18.4kms-^ ; ACDMh 
19.6kms~^; OCDM: 18.2kms-^; SCDM: 18.5kms"^ 
tCDM: 22.1 kms"^ . Only the CHDM model is able to match 
the high measured value within 3(t. We return to a discus- 
sion of this discrepancy in 



4.4 below. 



To test the sensitivity of the distributions of absorp- 



tion line parameters to the line-analysis method, we per- 
form a second set of analyses using SPECFIT. The results 
are shown in Fig. ^. The adopted threshold optical depth for 
finding lines is Tmin = 0.15, although smaller optical depth 
lines may result from the fits. (It would have been possi- 
ble to use a smaller value for Tmin for most of the models; 
the value adopted was required by the CHDM and tCDM 
models to avoid producing an excessive number of lines. The 
same value was adopted for the analysis of all the models 
for consistency.) The best fits to the measured A'hi distribu- 
tion are provided by the ACDMl and SCDM models, allow- 
ing for continuum offset corrections. The predicted CHDM 
and tCDM A''hi distributions do most poorly, both being 
significantly steeper than the measured distribution. The 
KS test apphed to the ACDMl and SCDM resuhs gives, 
respectively, rixs ~ 0.046, Pks = 0.16 and dys, ~ 0.056, 
Pks = 0.043. The remaining models do significantly more 
poorly, giving for CHDM: ^ks = 0.086, Pks = 2 x 10"*; 
ACDMh: ^ks = 0.073, Pks = 0.003; OCDM: c^ks = 0.073, 
Pks = 0.003; tCDM: ^ks = 0.092, Pks = 6 x 10"^ In con- 
trast to the AutoVP analysis, for which the CHDM model 
provides a marginally acceptable fit, the A'hi distribution 
derived for CHDM using SPECFIT provides a poor match 
to the data. By contrast, while the SPECFIT distributions 
for the ACDMl and SCDM models are acceptable, the Au- 
toVP distributions for these models are strongly rejected. 

As was found in the AutoVP analysis, the models predict 
too low fe-values: none of the models produces a h distribu- 
tion consistent with the measured distribution. The mea- 
sured median &- value is 24.2 ± 1.8kms~^ , where the large 
error is due to the wide dispersion in 6- values. All the model 
predictions are smaller, but only that of OCDM by more 
than 3(7. The SPECFIT analysis generally shows good con- 
sistency in the predicted number of absorption features. The 
analysis of the spectrum of Q1937— 1009 yields 609 absorp- 
tion features. The expected numbers of lines predicted by 
the models are for CHDM: 658; ACDMl: 527; ACDMh: 
596; OCDM: 462; SCDM: 557, and tCDM: 661. Allowing for 
Poisson fiuctuations, all the model predictions are consistent 
with the measured number except for ACDMl and OCDM, 
which are too low. Thus, although the ACDMl model pre- 
dicts the correct shape for the A'hi distribution, it predicts 
an incorrect normalization. 

The differences in the AutoVP and SPECFIT results may 
be due in part to the different relative emphasis the algo- 
rithms give to optically thin and optically thick systems. 
Splitting the SPECFIT results at tq = 1 shows that while all 
the models predict a number of lines with ro > 1 consistent 
(at the 3(7 level) with the measured number of 178 (CHDM: 
165; ACDMl: 192; ACDMh: 185; OCDM: 189; SCDM: 194, 
and tCDM: 170), only the CHDM, ACDMh and tCDM mod- 
els predict a number of optically thin lines consistent with 
the measured number of 431. (These models predict 493, 412 
and 491 lines, respectively.) The OCDM and ACDMl pre- 
dictions lie over 3a too low, while the CHDM and tCDM pre- 
dictions are marginally consistent with the 3ct upper limit. 
This contrasts with the AutoVP results, for which all the 
models predicted too many optically thin systems. The two 
algorithms also yield differing levels of acceptability of the 
shapes of the A'hi distributions for the optically thin sys- 
tems. While the AutoVP distributions agree well with the 
data for the CHDM and tCDM models, the SPECFIT results 
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are inconsistent with the data for all the models, except for 
the marginal case of tCDM (dKS = 0.09, Pks = 0.001). 

The results of AutoVP and SPECFIT agree more closely 
for the optically thick systems, although still with some dis- 
agreement. While the AutoVP analysis yields acceptable A'hi 
distributions for all the models (although the results for the 
SCDM and tCDM models are marginal), the SPECFIT anal- 
ysis yields distributions that are rejected for all the mod- 
els except CHDM (^ks = 0.08, Pks = 0.01), ahhough the 
ACDMl and OCDM predictions are marginal (both give 
rfKS = 0.14 and Pks ~ 0.001). Thus the acceptability of the 
predicted A'^ei distributions is somewhat sensitive to the line- 
fitting algorithm used. This is due in part to the marginality 
of the rejection levels: a small change in the distributions 
pushes the level of agreement above or below the 3cr rejec- 
tion threshold. Both algorithms nevertheless agree that the 
CHDM model provides the best match to the measured A'hi 
distribution for the optically thick systems. 

No significant difference is found in the median mea- 
sured 6- values for the optically thin and thick systems using 
SPECFIT. For to < 1, &med = 24.5 ± 2.3 km s"^ ; for tq > 1, 
fcmed = 24.1 ± 2.2kms~^ . Although all the models predict 
lower values for the median 6- value than measured, for both 
To < 1 and To > 1 systems, the differences are not highly 
significant given the large errors in the measured values. A 
stronger statistical statement is made using the full sam- 
ple. We note that the models predict that the optically thin 
systems will have smaller median Doppler parameters than 
the optically thick systems, but the differences are small, 
amounting to only l-2kms~^ . 

Just as is found for the full sample, the predicted b 
distributions for both the tq < 1 and tq > 1 subsamples are 
rejected by the KS test for all the models. The only exception 
is the marginal agreement found for the CHDM model for 
the optically thin systems (dKS = 0.13, Pks = 0.004). The 
SPECFIT analysis consistently finds that the models predict 
too low 6- values compared with the data. This contrasts with 
the AutoVP analysis, for which the b distribution predicted 
by tCDM for optically thick systems provides an excellent 
match to the data (while all other model predictions are 
rejected). 

It appears difficult to reach any definite conclusion re- 
garding the goodness-of-fit of the predicted Doppler param- 
eter distributions to the data. It is possible that AutoVP and 
SPECFIT cue off low optical depth fluctuations differently. 
The main lesson is likely that there is a limit to the physical 
interpretation of gas temperature or velocity dispersion that 
may be attached to the fit Doppler parameters. 

3.2 HS 1946-f7658 

Recent CMB experiments strongly favour a flat, or nearly 
flat, universe (Netterfleld et al. 2001; Pryke et al. 2001; 
Stompor et al. 2001). From hereon, we restrict the consid- 
eration of models to those currently most viable: CHDM, 
the two ACDM models, and tCDM. Ahhough the tCDM 
model is perhaps already excluded by the CMB data, it pro- 
vides useful comparisons with the predictions of the CHDM 
model and illustrates the degree to which simulations of the 
Lya forest may (or may not) be used to discriminate be- 
tween cosmological models based on the tests considered in 
this paper. 




flux 



Figure 9. Comparison between the measured flux distribution 
of HS 1946+7658 and the model predictions. A continuum ofli'set 
correction is applied to each model to enforce matching the cu- 
mulative distributions at flux values near unity. The dotted curve 
shows the measured cumulative flux distribution after removing 
pixels contaminated by identified metal lines. The best matches 
are obtained by the ACDMl and ACDMh models, with predicted 
distributions that bracket the measured one. 



3.2.1 Flux distribution 

Over the redshift range 2.5 < z < 3.0, the mean optical 
depth in the spectrum of HS 1946-1-7658 is found to be fa ~ 
0.24. The adopted optical depth rescalings and conitnuum 
offset corrections for these models are provided in Table ^ 
No additional evolution in the UV background was required 
{p = 0). 

The predicted and measured distributions of flux per 
pixel (excluding bad pixels and the region occupied by a 
Damped Lya Absorber for the measured distribution), are 
shown in Fig. |^. Also shown is the measured flux per pixel 
distribution after removing pixels contaminated by metal 
lines identified by Kirkman & Tytler (1997). (We note that 
fa is negligibly affected by the removal of these regions.) The 
best matches to the measured metal-free distribution are 
provided by the ACDMl and ACDMh models, with the re- 
spective maximum cumulative distribution differences from 
that measured of c^ks = 0.047 and c^ks ~ 0.039. Both mod- 
els are strongly ruled out by the formal KS test probability. 
Allowing for a reduction in the effective number of pixels by 
a factor 3-5 (Appendix ^ still yields unacceptable prob- 
abilities for the predictions. The two model distributions, 
however, bracket the measured distribution, so it is reason- 
able to expect a ACDM model with an intermediate value 
of aj would provide an acceptable fit. 
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Figure 10. Comparison between the measured wavelet coefficient 
cumulative distributions of HS 1946+7658 (solid curves) and the 
model predictions extrapolating about the simulation data at z = 
3 (dashed curves) and interpolating between the simulation data 
at 2 = 2 and 2 = 3 (dotted curves), allowing for continuum offset 
corrections in both. (For ACDMji, the dashed curves correspond 
to interpolation between the simulation data at 2 = 2.7 and z = 
3.) The curves from left to right correspond to the velocity scales 
2-4, 4-8, 8-16, 16-32, 32-64, and 64-128 kms"! . The best match 
for the velocity scale 16-32 km s~^ , corresponding to the median 
measured Doppler parameter of the absorption features, is found 
for the CHDM and tCDM models. All the models underpredict 
the amount of velocity structure on smaller scales. 



3.2.2 Wavelet coejjicient distributions 

The wavelet coefficient distributions are shown in Fig. |lo[ 
All the models poorly match the velocity structure on scales 
of 8-16 km s~^ and smaller, indicating the presence of sub- 
stantially more structure in the data on these scales than 
predicted by the models. This structure reflects at least in 
part the presence of metal lines. 

Because the IGM temperature decreases with time in 
underdense regions, within which the more abundant opti- 
cally thin features arise, it is unclear whether the best strat- 
egy for generating the distributions is by interpolating be- 
teen the simulation results at 2 = 2 and z = 3 or by extrapo- 
lating the optical depths from the z — 3 results alone, as the 
spectral range analyzed lies nearer z = 3 than z = 2. The 
distributions for both approaches are shown in Fig. The 
results of both methods are qualitatively similar. (We note 
that the two approaches produce negligible differences in the 
flux distributions.) Neither approach yields a good match at 
low velocity scales. On the scale 16-32 km s~^ , correspond- 
ing to the median measured Doppler parameter (see below) , 
acceptable agreement is found for the CHDM results based 
only on interpolation (rixs ~ 0.042, Pks ~ 0.026), while 
agreement for tCDM is found only for extrapolating from 



Figure 11. Comparison between the measured absorption line 
parameter distributions of HS 1946+7658 (solid curves) and the 
model predictions. While good agreement is generally found for 
the A^Hi distributions above lO'^^ cm~^ , all the models underpre- 
dict the number of lower column density systems. The CHDM 
model best reproduces the measured b distribution. The agree- 
ments improve when metal absorption systems identified in the 
observed spectrum are removed (dotted curves). 

z = 3 {(Iks = 0.045, Pks ~ 0.015). The agreement tends to 
improve for all the models on larger velocity scales. 

3.2.3 Absorption line parameter distributions 

All the models underpredict the number of low H i column 
density systems and low fo- value systems, as shown in Fig. |ll]. 
It is possible that the low A'hi and low 6-value systems in 
the measured spectrum are dominated by metal absorption 
features. Kirkman & Tytler (1997) identify a large number 
of metal lines in the redshift range analysed. Unfortunately, 
because these lines are frequently blended with Lya forest 
systems, it is not possible to match them on a one-to-one 
basis with the lines produced by AutoVP. In an attempt to 
account for the metal absorbers, we remove a narrow line 
(b < 15kms~^) from our list when it matches against a 
metal absorber in the list of Kirkman & Tytler (1997) to 
within a tolerance of 2A in the (observed) line-centre wave- 
length. This results in the rejection of 28 systems. The re- 
sulting cumulative distributions, shown by the dotted curves 
in Fig. |l^, match the predicted more closely. Formally, all 
the A''hi distributions are acceptable at the 3cr level by the 
KS test. The best matches are provided by the ACDMl 
and ACDMh models, which give for ACDMl: ^ks = 0.063, 
Pks = 0.057; and for ACDMh: dKS = 0.064, Pks = 0.053. 
Similar results are obtained either by interpolating between 
the z — 2 and z = 3 simulation results, or extrapolating 
from the z = 3 results alone. Only the CHDM model yields 
a b distribution consistent with the data. The KS test gives 
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Figure 12. Comparison between the measured absorption line 
parameter distributions of HS 1946+7658 and the model predic- 
tions, for systems with tq < 1 and tq > 1. The predicted dis- 
tributions agree better with the measured distributions for the 
optically thick systems than for the optically thin. 



c^Ks = 0.065, Pks = 0.048, extrapolating from the z = ?, 
simulation results. An inconsistent distribution results when 
based on interpolation between the z = 2 and 2 = 3 results 
because of the lower b- values at 2: = 2. 

As was the case for Q1937— 1009, the models that 
provide the best fits the flux distribution (ACDMl and 
ACDMh ) predict too small median Doppler parameters. 
The measured median h (after excluding metal lines) is 
24.9 ± 0.8 km s"^ , while ACDMl and ACDMh predict, re- 
spectively, 18.8kms~^and 19.2kms~^, both too small by 
over 7(T. 

The AutoVP analysis of the spectrum of HS 1946-1-7658 
yields 443 absorption lines. The predicted numbers for the 
various models are for CHDM: 508; ACDMl: 539; ACDMh: 
565; and tCDM: 539, all greatly in excess of the measured 
number except for CHDM. The larger predicted numbers 
are consistent with the general narrowness of the features. 

As for Q1937+1009, we split the line samples into two 
subsamples, optically thin (ro < 1) and optically thick 
(ro > 1) at line-centre. The resulting A^hi and h distributions 
(with identified metal lines removed as above) are shown 
in Fig. ^ Excellent agreement is found between the model 
predictions for the A^hi distribution and the measured distri- 
bution for the optically thick systems. Acceptable agreement 
(at the 3(T level) is found for the h distributions as well, ex- 
cept for the ACDMl model, which is marginally rejected by 
the KS test. The agreement for the optically thin systems 
is much poorer. Only the ACDM models yield acceptable 
A'hi distributions, while only the CHDM model predicts a 
6 distribution consistent with the measured. The measured 
6 distribution disagrees most strongly with the CHDM pre- 
diction at fe < 15kms~^ . Although we match 28 metal lines 



with the list of Kirkman & Tytler, they report 45 in the red- 
shift range of interest, which could potentially account for 
the difference. We note that these CHDM results are based 
on extrapolating from the 2 = 3 simulation results; the dis- 
tribution derived by interpolating between the 2 = 2 and 
2 = 3 results yields unacceptably small b-values, as for the 
other models. 

The AutoVP analysis of the spectrum of HS 1946-1-7658 
yields 79 absorption systems with ro > 1 and 364 with 
To < 1. The predicted numbers of optically thick features 
for the various models are for CHDM: 68; ACDMl: 102; 
ACDMh: 92; and tCDM: 76, all consistent with the mea- 
sured number within Poisson fiuctuations. The predicted 
numbers of optically thin features for the models are for 
CHDM: 440; ACDMl: 437; ACDMh: 473; and tCDM: 463. 
All the model predictions greatly exceed the number of mea- 
sured lines. 



3.3 Q0014+813, Q0302-003, Q0636+680, 
Q0956+122 

3.3.1 Flux distribution 

The analysis redshift ranges for each of the Hu et al. (1995) 
QSO spectra are given in Table |^. The mean optical depth 
found for each is: Q0014-I-813: r„ = 0.30; Q0302-003: 
fc = 0.31; Q0636-f 680: = 0.29; Q0956-H22: = 0.25. It 
was found unnecessary to include any amount of additional 
evolution in the UV radiation background. The adopted op- 
tical depth rescalings and continuum offsets are given in Ta- 
ble|. 

The distributions of fiux per pixel are shown in Fig. hSl 
The best-fitting model is ACDMl in all cases except 
Q0956+122, for which it is ACDMh. Formally, all the mod- 
els are rejected by the KS test. Allowing for a reduction 
in the efi'ective number of pixels by a factor of 3-5 due 
to correlations between neighbouring pixel values yields ac- 
ceptable agreement. The maximum differences between the 
measured cumulative distributions and those predicted by 
the ACDMl model, and the associated probabilities, are for 
Q0014-f 813: dKS = 0.031, Pks = 0.01 - 0.09; Q0302-003: 
dKS = 0.030, Pks = 0.02 - 0.11; and Q0636+690: ^ks = 
0.024, Pks = 0.09 - 0.31. The ACDMh model gives for 
Q0956-I-122: dKS = 0.035, Pks = 0.003-0.04. As was found 
for Q1937-1009 and HS 1946+7658 above, the ACDMl and 
ACDMh model predictions tend to bracket the measured 
distributions. The CHDM and tCDM models are strongly 
rejected by all the QSO spectra, even allowing for possible 
correlations in the pixel flux values. 



3.3.2 Wavelet coejficient distributions 

As for HS 1946+7658, the predicted model distributions for 
the wavelet coefficients are computed by extrapolating about 
the simulation results at 2 = 3, as this is closest to the red- 
shift ranges of the spectra. The values for a from Table ^ are 
used in the extrapolation. The resulting wavelet coefficient 
distributions are displayed in Fig. |l^. The agreement be- 
tween the model predictions and the data is poor at the pixel 
scale (4-8kms~^ ). As was found in the previous QSO spec- 
tra, the data show a greater amount of structure. The data 
continue to show more structure on the 8-16 kms~^ scale 
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Figure 13. Comparison between the measured flux distributions 
of tfio Hu et al. (1995) QSO spectra and the model predictions. 
A continuum offset was applied to each model to enforce match- 
ing the predicted and measured cumulative distributions at flux 
values near unity. For all spectra except Q0956-I-122, the ACDMl 
model provides the best match. For Q0956-I-122. the best match 
is given by the ACDMjj model. 

as well. Since the spectra resolve features on these scales, 
the differences appear to be physical. At the next level 
(16-32 kms-i ) the trend reverses: the models predict more 
structure than found in the data. The best agreement is 
consistently found for the CHDM model. The agreement, 
hovirever, is statistically acceptable only for Q0636+680 and 
Q0956+122, according to the KS test. (For Q0636+680, 
dKS = 0.018, fks = 0.88, and for Q0956+122, dKS = 0.035, 
Pyjs, = 0.17.) The level of agreement shows significant scat- 
ter. The CHDM model predictions at 16-32 km s"'^ agree 
well with the data for Q0636+680 and Q0956-I-122, but not 
for the remaining two QSOs. The CHDM model performs 
acceptably well on larger scales for all the QSOs. 

3.3.3 Absorption line parameter distributions 

The H I column density and Doppler parameter distribu- 
tions derived by the AutoVP analyses of the QSO spectra of 
Hu et al. (1995) are shown in Figs ^ and ^ along with 
the model predictions. All the models predict an H i column 
density distribution and Doppler parameter distribution 
that both peak at lower values than measured. Of the four 
QSO spectra, the A'hi distribution for only Q0636-I-680 is 
well-represented by any of the models. The prediction of 
CHDM, the best-fitting model, for the A^hi distribution of 
Q0636+680 gives ^ks = 0.11 and Pks = 0.004, while all 
other models are rejected. For all the remaining QSO spec- 
tra, CHDM provides the best match to the measured Ahi 
distribution as well, although the predicted distributions are 
rejected by the KS test. 



Figure 14. Comparison between the measured wavelet coefficient 
cumulative distributions of the Hu et al. (1995) QSO spectra 
(heavy solid curves) and the model predictions, allowing for con- 
tinuum offset corrections. The curves from left to right correspond 
to the approximate velocity scales 4—8, 8-16, 16—32, 32-64, and 
64— 128 km s~^. Model predictions are shown for CHDM (long- 
dash), ACDMl (short dash), ACDMh (dot - short dash), and 
tCDM (short dash — long dash). The best match for the veloc- 
ity scale 16-32 km s~^, corresponding to the median measured 
Doppler parameters, is found for the CHDM model in all cases. 
All the models underpredict the amount of structure on smaller 
velocity scales. 

None of the models provides an acceptable match to 
the measured b distributions, although the CHDM model 
consistently comes closest. As found for Q1937— 1009 and 
HS 1946-^7658, the model (ACDMl or ACDMh ) which best 
agrees with the measured flux per pixel distribution pro- 
duces Doppler parameters that are too small. The measured 
median Doppler parameters (after removing metal lines, as 
discussed below), for Q0014-I-813, Q0302-003, Q0636-f680, 
and Q0956-hl22 are, respectively, 30.5 ± 1.2 kms"^ , 29.5 ± 
1.2kms-i , 26.1 ± 1.1 kms'^ , and 28.4 ± 1.3 kms'^ . The 
predictions of ACDMl range between 17-18 km s~^ for all 
the QSOs, and for ACDMh the prediction is 19kms~^ , all 
too small by 35-75 per cent and by at least 6a. 

The AutoVP analysis of Q0636-)-680 produces 272 lines 
(after removing identified metal lines). The CHDM model 
predicts 334 lines, greatly in excess of the measured num- 
ber. Similar or larger discrepancies between the predicted 
and measured number of absorption lines are found for all 
the QSO spectra and models, generally reflecting the nar- 
rowness of the predicted absorption features compared with 
the measured. 

We note that Hu et al. (1995) report a signiflcantly 
larger number of narrow absorption features in the spec- 
trum of Q0636-I-680 than in any of the other spectra, and 
that most of these may be identified with metal lines (13 
lines in the redshift range analysed here) . They consequently 
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Figure 15. Comparison between the H I column density distri- 
butions measured using the Hu et al. (1995) QSO spectra and the 
model predictions. The curves are labelled as in Fig. |l^ All the 
models predict a steeper distribution than measured. The dotted 
curves show the distributions after the removal of identified metal 
lines. 
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Figure 16. Comparison between the Doppler parameter distri- 
butions measured using the Hu et al. (1995) QSO spectra and the 
model predictions. The curves are labelled as in Fig. All the 
models predict a distribution peaking at lower values than mea- 
sured. The dotted curves show the distributions after the removal 
of identified metal lines. 



exclude Q0636+680 from their statistics of the Lya forest. 
We similarly find a larger number of narrow lines [b < 
lOkms^^) in Q0636+680 compared with the other spec- 
tra. We find that, based on the KS test, the flux and A^hi 
distributions show no anomalous behaviour compared with 
the other three spectra. In fact, these distributions, along 
with the b distribution, agree better with the model predic- 
tions than the distributions found for any of the other QSO 
spectra. The amount of metal line contamination based on 
the tabulations in Hu et al. (1995) is small (5 per cent 
of the lines in Q0636-f680, 2 per cent in Q0956-I-122, no 
lines in Q0014-I-813 and 1 line in Q0302-003). Removing 
the narrow lines in the AutoVP analysis that most closely 
match the metal lines reported in Hu et al. (1995) has 
little effect on the A^hi and b distributions, as shown in 
Figs |l^ and Although some of the narrow absorption 
features remaining may still be unidentified metal lines, 
the similarity in the A^hi distributions between Q0636+680 
and the other three QSOs suggests it is unlikely the con- 
tamination is large, so that it would seem unlikely that 
unidentified metal lines account for the large difference be- 
tween the 6 distribution of Q0636-f680 and the other three 
QSOs. Moreover, there is no statistically signiflcant differ- 
ence between the median Doppler parameters of the opti- 
cally thin (6mcd = 25.8 ± 1.5kms~^) and optically thick 
(fomcd = 26.8 ± 1.4kms"^) systems in Q0636+680 (as is 
true for all the Hu et al. QSOs). If there were still sub- 
stantial metal contamination present, a significantly larger 
median 6-value for the optically thick systems may be ex- 
pected. Because the median 6-values of the optically thin 
and thick systems are consistent for each of the QSOs, the 
discrepancy between the predicted and measured Doppler 
parameters found may not be attributed predominantly to 
the optically thin systems, unlike the cases of Q1937— 1009 
and HS 1946-^7658. 



3.4 J2233 606 

3.4-1 Flux distribution 

The mean optical depth for J2233— 606 is fa — 0.16 over the 
analysis redshift interval 1.732 < z < 2.132. The rescaling 
parameters used are given in Table ^. The simulation results 
at only z — 2 are used to extrapolate over the analysis red- 
shift range, using the values for a from Table ^. It was found 
unnecessary to include any additional amount of evolution 
in the UV radiation background. 

The distributions of fiux per pixel are shown in Fig. |l^ 
All of the models differ greatly from the data; the ACDMl 
model is closest. Cristiani & D'Odorico (2000) have iden- 
tified a large number of metal absorption features in the 
spectrum. The dotted line in Fig. ^ shows the flux distri- 
bution after removing regions of the spectrum within 2A of 
an identifled metal line. The effect is to worsen the agree- 
ment with the models. The reason for the poor agreement is 
unclear. At the low redshift of the QSO (zem = 2.238), the 
fundamental mode across the simulation volume is begin- 
ning to become non-linear. As a check, we have computed 
the flux per pixel distributions by extrapolating the optical 
depths from the simulation output at z = 3 to the redshift 
range of the spectrum. Nearly identical fiux per pixel distri- 
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Figure 17. Comparison between the measured flux per pixel dis- 
tribution of J2233— 606 and the model predictions. A continuum 
offset was applied to each model to enforce matching the cumu- 
lative distributions at flux values near unity. None of the mod- 
els recover the measured flux distribution, although the ACDMl 
model is closest. The dotted line shows the measured flux distri- 
bution after removing regions contaminated by identified metal 
lines. 

butions result, so that the disagreement appears not to be 
due to a lack of convergence in the simulation. 

3.4-2 Wavelet coefficient distributions 

The wavelet coefficient distributions are shown in Fig. |l8| 
All the models poorly match the velocity structure on 
scales of 7-13 km s~^ and smaller. The best agreement on 
the 13-27 km s~^ scale, corresponding to the measured me- 
dian Doppler parameter (see below), is provided by the 
CHDM model, although the match is rejected by the KS 
test (c/ks = 0.09, Pks = 10"^). The models recover the 
measured behaviour on larger velocity scales only moder- 
ately well to poorly. 

3.4-3 Absorption line parameter distributions 

Despite the poor agreement with the measured flux dis- 
tribution, all the models well reproduce the A^hi distribu- 
tion, as shown in Fig. |l9| (Using the line list of Cristiani & 
D'Odorico (2000), we match 28 of the narrow lines found 
to identified metal lines and remove them.) The best agree- 
ment is given by the ACDMl model, with dKS ~ 0.078 and 
Pks = 0.20. The model predictions, however, disagree in the 
number of lines. The predicted numbers are for CHDM: 306, 
ACDMl : 304, ACDMh : 312, and tCDM: 327, aU greatly 
in excess of the measured number of 192. 

The model predictions for the b distributions agree 
poorly with that measured. The CHDM model agrees best. 



Figure 18. Comparison between the measured wavelet coeffi- 
cient cumulative distributions of J2233— 606 (solid curves) and 
the model predictions (dashed curves), allowing for continuum 
offset corrections. The curves from left to right correspond to 
the approximate velocity scales 4-7, 7-13, 13-27, 27-54, and 54— 
108 km s"'^ . The best match for the velocity scale 13— 27 km s~^ , 
corresponding to the median measured Doppler parameter, is 
found for the CHDM model. All the models underpredict the 
amount of velocity structure on smaller scales. 
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Figure 19. Comparison between the measured absorption line 
parameter distributions of J2233— 606 and the model predictions. 
The dotted lines show the measured distributions once identified 
metal lines have been removed. 
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with dKS = 0.13 and Pks = 0.004. The predicted dis- 
tributions of the remaining models are all strongly re- 
jected. As was found for all the previous QSO spectra, the 
model (ACDMl ) which provides the best match to the 
flux per pixel distribution underpredicts the measured me- 
dian Doppler parameter. The measured median is 24.7 ± 
1.2kms~^, while ACDMl predicts 17.2 kms~^, too small 
by 6a. 

Splitting the sample into optically thin and optically 
thick systems tends to worsen the agreement for the A^hi 
predicted distributions of the optically thick absorbers to 
the point that they are only marginally acceptable (the best 
cases are CHDM and ACDMl, both with ^ks = 0.25 and 
Pks = 0.006). The predicted Ahi distributions for the opti- 
cally thin systems, however, all agree well with the measured 
(optically thin) distribution. Only the CHDM and tCDM 
models predict b distributions consistent with the measured 
for the optically thick absorbers (the tCDM prediction is 
marginal), while only the CHDM model predicts a b distri- 
bution consistent with the measured for the optically thin 
systems. The agreement for CHDM tends to improve for 
both the To < 1 and tq > 1 subsamples compared with the 
full sample. The number of optically thick lines predicted by 
CHDM is 60, agreeing well with the measured number of 48. 
The number of optically thin lines, however, agrees poorly: 
the predicted number is 246, while only 144 are measured. 



4 DISCUSSION 

In this section we consider the possible systematics and lim- 
itations of each of the tests employed, and discuss what may 
still be concluded from the comparison between the models 
and the data. No single cosmological model clearly emerges 
as the best based on the full set of statistical tests. The mea- 
sured distributions of flux per pixel are the most accurately 
reproduced, with the agreement reaching high precision. The 
agreement between the predicted and measured wavelet co- 
efficients, however, is substantially poorer. The data gener- 
ally show considerably less structure than the models predict 
on the velocity scale of approximately 15-30 km s~^ . While 
the agreement often improves on larger velocity scales, the 
best-fitting models tend to be those that provide poor fits 
to the measured pixel flux distributions. The Voigt absorp- 
tion line decompositions of the spectra similarly display a 
conflict: most of the models predict significantly narrower 
absorption features than measured, with the best agreement 
obtained by those models which agree most poorly with the 
measured fiux per pixel distributions. 

We show below that the shape of the flux per pixel dis- 
tribution is sensitive to the amount of power in the models 
on small scales. The absorption properties of the spectra 
tend to be dominated by features that are marginally op- 
tically thick at line-centre (13.5 < logj^Q A^hi < 14), which 
are reproduced in the simulations by structures with fila- 
mentary morphologies (Miralda-Escude et al. 1996; Zhang 
et al. 1998). The thicknesses of the filaments correspond 
to the scale height expected for photoionized gas in hy- 
drostatic equilibrium within the gravitational potential of 
structures with a moderate cosmological overdensity (Zhang 
et al. 1998). For small density contrasts, this scale height 
is essentially the cosmological Jeans length of the gas. We 



shall show that the fiux distributions produced by the 
absorbers vary systematically with the amount of power 
in the linear power spectra on these scales. In the lin- 
ear theory of the gravitational growth of density fiuctua- 
tions including baryons, the Fourier density modes of the 
baryons are suppressed by the factor 1/[1 -I- (fc/fcj)^], where 
fcj — {AnGpa^Y^'^ /cs is the (comoving) cosmological Jeans 
wavenumber for a total cosmological density p and bary- 
onic sound speed Cs, and a is the expansion factor (Pee- 
bles 1984, 1993). (The corresponding proper Jeans length is 
Lj = 2-Ka/kj = 2n{2/3y^^Cs/H(z), where H{z) is the Hub- 
ble constant at redshift z.) It is therefore useful to define the 
density fluctuations crj filtered on the scale kj according to 

"'^X '^°^' [l + (fc/fc.T)T ' 

where A^(fc) is the dimensionless power spectrum 
P{k)/2iT^ at the scale k. For simplicity, we define aj 
at z — 3, choosing Cs to be the isothermal sound speed 
16.8 kms~^ for fully ionized gas at a temperature 2 x lO** K , 
the estimated temperature for gas with an overdensity near 
unity at z = 3 (McDonald et al. 2000a; Ricotti, Gnedin & 
ShuU 2000; Schaye et al. 2000). 



4.1 Flux distribution 

The simulations are able to reproduce the measured fiux 
per pixel distributions to extremely high accuracy. The best 
case is given by the SCDM model for Q1937-1009. The 
maximum difference between the measured and predicted 
cumulative flux distributions is dKS ~ 0.016, close to the 
level of precision of the simulation (see Appendix For 
HS 1946-1-7658, the ACDMh model predicted distribution 
agrees with the measured to (Iks < 0.04. Similar levels of 
agreement are found for the other QSO spectra examined, 
except for J2233— 606 which appears anomalous. 

Despite the close agreement between the predicted and 
measured flux distributions, the formal statistical accept- 
ability of the matches, as given by the KS test, is gener- 
ally poor. This is in part due to the large number of pix- 
els involved (approximately lO**), which permits a stringent 
comparison to be made. It is, however, also due to devi- 
ations from the theoretical KS probability distribution of 
dKS. In practice, the probability distribution is broader than 
the theoretical, which is likely a consequence of correlations 
between neighbouring pixel fiux values (see Appendix 
Allowing for the correlations to decrease the effective num- 
ber of degrees-of-freedom yields acceptable agreement for 
the predicted distributions of either the ACDMl or ACDMh 
model (or both) for aU the QSOs (except HS 1946-1-7648, al- 
though the predicted distributions of the two ACDM models 
bracket the measured distribution, and J2233-I-606), while 
the predictions of the CHDM and tCDM models are con- 
sistently rejected. We note that Ranch et al. (1997) and 
McDonald et al. (2000b) also found very close agreement be- 
tween the predicted flux distribution of a ACDM model simi- 
lar to our ACDMh model and the measured flux distribution 
in a set of QSO spectra observed with the Keck HIRES. In 
contrast to our results, however. Ranch et al. found the pre- 
dicted flux distribution of an SCDM model nearly identical 
to ours to agree less well with the measured. 
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The flux distributions are found to vary systematically 
with CTj , the density fluctuations on the scale of the cosmo- 
logical Jeans length. As shown in Fig. ^, as crj increases, 
the distributions flatten (as measured between flux values 
of 0.2 and 0.8). This is simflar to the trend found by MOO 
of increasing width of the probability distribution of log r 
with increasing small-scale power. The best-fltting predicted 
flux distribution for Q1937-1009 is provided by the SCDM 
model, for which crj = 1.6. Although the predicted flux dis- 
tributions of the ACDMl and ACDMh models agree less 
well, their cumulative flux distributions bracket the mea- 
sured distribution, as is found for all the QSO spectra ex- 
cept J2233+606. The values of crj for these models (1.7 and 
1.3, respectively), similarly bracket the value of aj for the 
SCDM model, so that it is reasonable to expect a ACDM 
model with an intermediate value of crj would provide a bet- 
ter flt. 

The success of the ACDM models in predicting pixel 
flux distributions that bracket the measured distributions 
suggests a strong bound on the small scale power of the 
primordial power spectrum at z = 3 of 1.3 < crj < 1.7 
(with greater than 3cr certainty). This result combined 
with limits on crg^-i from cluster abundances (Pierpaoli 
et al. 2001), on the primordial power spectrum shape pa- 
rameter (0.19 < Fps < 0.37; Eisenstein & Zaldarriaga 
2000) and its amplitude (Bunn & White 1997) and spec- 
tral index (0.9 < n < 1.1; Netterfield et al. 2001; Pryke 
et al. 2001; Stompor et al. 2001), imposes the limit on 
the cosmological matter density 0.26 < Q,m < 0.43 for a 
flat universe (SIa = 1 — ^m), and assuming Q^h^ ~ 0.020 
(O'Meara et al. 2000), 0.45 < h < 0.85, and no mas- 
sive neutrino species. These restrictions ease somewhat to 
0.26 < ^Im < 0.68 if up to two massive neutrino species 
are allowed accounting for a total mass fraction < 0.2, 
and the 3a limits of Pierpaoli et al. (2001) are permit- 
ted. These limits are in good agreement with those found 
elsewhere (eg, Netterfleld et al. 2001; Pryke et al. 2001; 
Stompor et al. 2001). Adopting Fps — 0.15 in accordance 
with Croft et al. (2000), based on the flux power spectrum 
Ppik) measured in several Lya forest spectra, gives (for 
1.3 <aj < 1.7) the limits 0.23 < Qm < 0.45 (^a = I-^a/, 
Sli/ =0), in satisfactory agreement with the limits of Croft 
et al. (2000). 

4.2 Evolution of F 

Since the gas is in photoionization equilibrium, the optical 
depth in Lya is predicted by the models only to within the 
factor feion (§ The re-normalizations of the optical depths 
required for a given model to match the measured mean op- 
tical depth in Lya of a QSO spectrum provides an estimate 
of the value of F required. 

In Figure ^ we show the required values of F for the 
models to match the measured mean optical depth in 
each QSO spectrum. The width of a given horizontal bar 
indicates the redshift range analysed in the QSO spectrum, 
and the corresponding point marks the required value of F at 
the central redshift. Two estimates are made. For the flrst, 
we show the values of F assuming the value for Qb adopted in 
each model. For the second, we show the values of F assum- 
ing Qbh'^ = 0.020 (O'Meara et al. 2000) after rescaling the 
optical depths by bion. The scatter is considerably reduced 




2.5 3 3.5 4 4.5 5 



1 + z 

Figure 20. Required H I photoionization rate F required for 
the models to match the measured mean optical depths, for 
CHDM (circles), ACDMl (triangles), ACDMh (inverted trian- 
gles), tCDM (squares), OCDM (diamonds), and SCDM (pen- 
tagons). The upper panel shows F for the value of Qi, assumed 
in the model. The lower panel shows F assuming a fixed value of 
Ql,h^ = 0.020 for all the models. The predicted values are close 
to the estimated F from QSO sources alone for an intrinsic QSO 
spectral index of ag = 1.5 (solid curve) and aq = 1.8 (dashed 
curve). 



in the latter case. Also shown are the theoretical predictions 
for F due to QSO sources alone, assuming an intrinsic QSO 
spectral index of ag = 1.5 or 1.8 (Haardt & Madau 1996). 
The required values of F are found to lie within a factor of 

2 of the predicted rates for Qth^ — 0.020, suggesting lit- 
tle, if any, sources additional to QSOs are required, or are 
even permitted without destroying the agreement between 
the models and the measured spectra. We note, however, 
that the rate of evolution of F necessary to match the mea- 
surements of the mean optical depth over the redshift range 
probed by the spectrum of Q1937-1009 (3.1 < z < 3.7) 
was found to be signiflcantly slower than given by Haardt & 
Madau (1996). The difference may be due to the uncertainty 
in the QSO counts at these high redshifts. 

The values we obtain for F for the ACDMh model at z = 

3 is about twice that reported by McDonald et al. (2000b) 
for a very similar model. (Here and below, we use bion to 
re-normalize the reported values of F to Qb^ — 0.020.) The 
difference is likely due primarily to the larger value for fa 
they flnd. Our results are in closer agreement with those of 
Ranch et al. (1997) in the redshift intervals 2.5 < z < 3.5 
and 3.5 < z < 4.5, although our values for F still lie some- 
what higher. By contrast, our value for F for the SCDM 
model in the interval 3.1 < z < 3.7 is about a factor of 2 
below that found by Ranch et al. for SCDM in the range 
3.5 < z < 4.5. The difference may in part stem from an 
overestimate of fa by the SCDM model simulation used by 
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Rauch et al. (that of Croft et al. 1997), as was demon- 
strated by Theuns et al. (1998b). The value for T reported 
by Weinberg et al. (1997) at 2; = 3 is just above the spread 
of values we find for our various models. 

4.3 Wavelet coefficient distributions 

The wavelet coefficient distributions predicted by the mod- 
els show considerable disagreement with the measured dis- 
tributions. The deviations arc greatest on the velocity scale 
of approximately 15-30 kms~^ , with the models generally 
predicting a much greater amount of structure than is found 
in the data. The closest agreement is found for the CHDM 
model on this scale for all the QSO spectra. The CHDM 
model predicts distributions on this scale consistent with the 
data for Q1937-1009 and HS 1946+7658. The level of agree- 
ment, however, is inconsistent among the Hu et al. (1995) 
QSOs on the 16-32 km s^^ scale. While good agreement is 
found for Q0636-I-680 and Q0956-I-122, the CHDM predic- 
tions are strongly rejected by the KS test for Q0014+813 
and Q0302-003. 

To investigate the reasons for the inconsistency in the 
level of agreement on the 16-32 km s~^ scale of a given model 
among the various Hu et al. QSO spectra, we compare 
the measured wavelet distributions of these spectra. A com- 
parison of the distributions of Q0302— 003 and Q0636+680 
shows that they are only marginally consistent on the 16- 
32 km s"^ scale (^ks = 0.08, Pks = 0.005) (while con- 
sistent on the other scales examined). A comparison be- 
tween Q0302— 003 and Q0956-I-122, however, shows the dis- 
tributions to be consistent on the scale of 16 32kms~^ and 
higher, but not on smaller scales. The data show statistically 
significant inhomogcncity. For this reason, we have chosen 
to treat the analyses of the QSOs separately. (We note that 
the flux per pixel distributions of the four Hu et al. QSO 
spectra show no significant differences over the analysis red- 
shifts, and so in principle could have been treated together. 
For uniformity of presentation, we compare the numerical 
results with the results from each spectrum individually as 
well.) 

The level of agreement between the models and the data 
is not much improved on the next smaller scale of approx- 
imately 8-16 km s~^. While the predicted distributions of 
coefficients for Q1937— 1009 tend to scatter either above or 
below the measured, the predicted coefficients are systemat- 
ically too small on this scale for HS 1946-1-7658. The addi- 
tional measured structure in the latter spectrum may reflect 
the large number of metal absorption lines identifled (Kirk- 
man & Tytler 1997), which will give rise to features on this 
scale. 

None of the models reproduce the light fluctuations on 
the pixel scale, as determined by the lowest level wavelet 
coefficients. It may be that the disagreement results from 
limitations of the simulations. For an expansion veloc- 
ity difference across the simulation box at z — 3.5 of 
~ 1000 km s~^ and 256 grid zones along a side, the char- 
acteristic velocity resolution is ~ 4kms~^ , so that it may 
be difficult to reproduce fluctuations in the spectra on these 
scales. 

Although the wavelet analysis provides a quick and pre- 
cise means of comparing the model predictions with the 
data, it has the disadvantage of not directly relating any dis- 



crepancies to physical quantities. Larger (or smaller) mea- 
sured coefficients than predicted at a given velocity scale 
will arise because fewer (or more) absorption systems are 
predicted by the model on this scale, but the wavelet coeffi- 
cients alone caimot determine whether this is a consequence 
of a mismatch of the density fluctuation spectrum, the as- 
sumed cosmological model, the presence of contaminating 
lines (metal systems) , or the thermal properties of the gas. 
To examine these issues, we also perform a Voigt absorption 
line analysis. 

4.4 Absorption line parameter distributions 

The shapes of the predicted and measured H i column den- 
sity distributions tend to accord well, though not perfectly. 
The Auto VP analysis of Q1937— 1009 yields agreement with 
the measured Nm distribution only for the CHDM model, 
and even this is marginal. The analysis of HS 1946+7658 
(after removing identified metal lines) favours instead the 
ACDM models, although marginal agreement is again found 
for CHDM and tCDM. The predicted and measured Nm 
distributions for all the Hu et al. (1995) spectra tend not 
to agree at an acceptable level, though the predictions of 
CHDM are favoured. The agreement with the A'hi distri- 
bution measured in J2233— 606, however, is acceptable for 
all the models. The level of agroomont is somewhat sensi- 
tive to the analysis method: better agreement is found for 
Q1937— 1009 between the predicted and measured A^'hi dis- 
tributions using SPECFIT than Auto VP, though stfll at a 
marginal to poor level, except for ACDMl for which the 
agreement is good. Of greater concern is the number of 
absorption systems found: both the AutoVP and SPECFIT 
analyses predict numbers of features inconsistent with the 
numbers measured. 

Splitting the absorption line samples into subsamples 
optically thin and thick at the Lya line-centre reveals that 
the discrepancies originate primarily from the optically thin 
systems. Acceptable agreement is found for the optically 
thick systems between the predicted and measured iVni dis- 
tributions for Q1937— 1009 for all the models using AutoVP. 
(The agreement tends to worsen for SPECFIT, however, 
again suggesting that caution should be exercised in judg- 
ing the viability of a model on the basis of the TVhi dis- 
tribution alone.) The predicted and measured distributions 
for HS 1946+7658 for the optically thick systems coincide 
extremely closely for all the models examined, and accept- 
able agreement is again obtained for J2233— 606 (though at 
a more marginal level). There is generally acceptable agree- 
ment between the predicted and measured numbers of opti- 
cally thick absorbers as well. 

The measured and predicted A'hi distributions for the 
optically thin systems match much more poorly. While ex- 
cellent agreement is found between the predictions of the 
CHDM and tCDM models and the measured distribution in 
Q1937— 1009 using AutoVP, all the other models are strongly 
rejected. (The SPECFIT analysis results in no acceptable 
models, though tCDM is marginal.) Poorer agreement is 
found for HS 1946+7658 among the optically thin systems 
than the optically thick as well, although the ACDM mod- 
els are still acceptable. For J2233— 606, the agreement with 
CHDM and tCDM is excellent, while the ACDM models are 
marginal. 
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We note that our results are somewhat at variance with 
those of Gnedin (1998), who uses a pseudo-hydrodynamical 
numerical technique to test a variety of cosmological mod- 
els beised on the predicted H i column density distributions. 
Gnedin finds acceptable agreement between the predictions 
of several models for the shape of the A^hi distribution with 
the tabulated distribution of Hu et al. (1995). By contrast, 
we find none of our models recovers the Hu et al. distri- 
bution, including our ACDM models which are very simi- 
lar to models Gnedin finds acceptable. Based on the results 
of several models, Gnedin places 2a and 3cr constraints on 
the size of density fluctuations for wavenumbers twice the 
Jeans wavcnumbcr. Using his conversion formula to 0-34, 
the density fluctuations gaussian smoothed on the scale 
A;34 = 34f2j^^^ft.Mpc~^, where JIm is the total matter den- 
sity parameter, and the "effective equation of state" param- 
eters reported by Ricotti et al. (2000), these limits become 
(z = 2.85) 1.4 < a34 < 2.6 (2cr) and 1.3 < (T34 < 3.1 (3cr). 
These rule out our CHDM model ((734 = 1.4) at the 2a level 
and our tCDM model (1734 = 1.1) at the Za level, but not 
the others. Yet we find that the shapes of the A^hi distri- 
butions predicted by CHDM and tCDM for Q1937-1009 
(restricted to the optically thick systems, corresponding ap- 
proximately to the column density range used by Gnedin in 
the same redshift range), are consistent with the measured 
(optically thick) distribution, with the CHDM model pro- 
viding the best match of all the models. These two models 
also fare no worse than the others in comparison with the 
Hu et al. (1995) data, and often provide the best (though 
usually unacceptable) match to the measured Nya distribu- 
tion. Whether the disagreement between our findings and 
Gnedin's is due to differences in the absorption line analy- 
sis approaches or the simulation methods (Meiksin & White 
2001) is unclear. 

A much larger discrepancy is found between the mea- 
sured Doppler parameter distributions and the distributions 
predicted by the simulations. The median measured Doppler 
parameters generally exceed the predicted by as much as 30- 
60 per cent. The best agreement is consistently found for the 
predictions of the CHDM model, although this model pro- 
vides a poor match to the measured fiux per pixel distribu- 
tions. The next best match to the measured b distributions is 
provided by tCDM, which also predicts flux per pixel distri- 
butions in disagreement with the data. On the other hand, 
the models that match the measured pixel flux distributions 
predict Doppler parameters that are much too small. There 
appears no simple remedy to this conflict. 

Splitting the sample into optically thick and optically 
thin absorbers at the Lya line-centre shows that the dis- 
crepancy in the Doppler parameters is sensitive to the line- 
centre optical depth tq. The discrepancy in the measured 
and predicted Doppler parameters has been recognized for 
optically thick systems for ACDM, OCDM, and SCDM 
models (Theuns et al. 1998b; Bryan et al. 1999; Theuns 
et al. 1999; Bryan & Machacek 2000; MOO). The results of 
MOO, however, suggest that the discrepancy may be model- 
dependent, as the tCDM model was found there to provide 
reasonably good agreement. Here wo find that the tCDM 
model provides an excellent match to the measured h dis- 
tributions for the optically thick absorbers in Q1937— 1009 
and HS 1946-1-7658, while the 6 distributions predicted (for 



To > 1) by the CHDM and ACDMh models are consistent 
with that of HS 1946-1-7658, though not of Q1937-1009. 

None of the models matches the distributions measured 
in the Hu et al. (1995) QSO spectra, although the CHDM 
model comes close for Q0636+680. This latter point is of 
particular concern, since previous model comparisons have 
been based on the Hu et al. QSO sample, supplemented 
by several other QSO spectra (Kim ct al. 1997), but with 
Q0636+680 excluded. The reason for excluding it was the 
relatively large number of absorption features found, as well 
as the large number of metal systems identified in the spec- 
trum: it appeared that the spectrum may be excessively con- 
taminated by metal linos. Wo perform a comparison between 
the AutoVP results for Q0636+680 and the combined other 
three QSOs of Hu et al. over the common redshift range 
2.7 < z < 3.0 (with identified metal lines removed), and 
confirm a statistically significant difference in the b distri- 
butions using the KS test (^ks = 0.16, Pks = 6 x 10"*). The 
A^Hi distributions, however, agree extremely well, which may 
not have been expected if a large amount of metal contam- 
ination was still present in the Q0636-I-680 line list. (The 
distributions of restframe equivalent widths and line-centre 
optical depths also agree extremely well.) The larger num- 
ber of lines in Q0636-I-680 is expected given the relative 
narrowness of the features (in order to recover the measured 
value of fa). Since all the spectra probe large cosmological 
distances (A2;/(l + z) ^ 0.1), sample variance would ap- 
pear an unlikely explanation for the discrepancy in 6- values. 
The smaller median 6- value measured in Q0636-I-680 is also 
in better agreement with the median 6-value measured in 
HS 1946+7658, at only a slightly smaller redshift. We con- 
clude that there is little justification for preferring the results 
of the other Hu ct al. QSO spectra over those of Q0636+680. 
Adopting the tabulated results of Hu et al. (1995) or Kim 
et al. (1997) as the fiducial b distribution for comparison 
with the simulation predictions may exaggerate the true 
level of disagreement, if any. 

More pronounced disagreement in the Doppler param- 
eters is found for the optically thin absorbers. Only the 
CHDM model is able to produce a distribution consistent 
with any of those measured (only for HS 1945+7658 and 
J2233-606, although the SPECFIT results of the CHDM 
model for Q1937— 1009 are marginal). The discrepancy in 
the numbers of absorption systems is least for the CHDM 
model. This may be attributed to the larger 6-values: fewer 
broader lines are required to produce the same mean opti- 
cal depth fa- A similar explanation accounts for the better 
agreement in the wavelet coefficient distributions compared 
with the other models on the scale of 16 32kms~^ : fewer 
features on this scale are produced by the CHDM model 
than by the other models, resulting in a smaller proportion 
of large coefficients and so a better match to the measured 
distributions. 

Possibly a compromise cosmological model may be 
found which fits all the distributions. Increasing the baryon 
density will increase the temperature of the moderate den- 
sity gas and hence the Doppler parameters (Meiksin 1994; 
Theuns et al. 1998b). For instance, doubling the baryon den- 
sity for ACDMl from Vlbh^ = 0.015 to 0.03 would increase 
the gas temperature by 40 per cent, using the temperature- 
baryon density scaling of Zhang et al. (1998). This may 
increase the median Doppler parameter by as much as 10- 
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20 per cent, depending on the contribution of peculiar ve- 
locity broadening to the line widths (Zhang et al. 1998). 
Although such a high baryon density contradicts the deu- 
terium abundance determinations of O'Meara et al. (2000) 
within the context of standard Big Bang Nucleosynthesis, 
it is consistent with reported CMB measurements of the 
acoustic peaks (Netterfield et al. 2001; Pryke et al. 2001; 
Stompor et al. 2001). 

There is an issue of convergence of the simula- 
tion results, particularly for the b distribution. Theuns 
et al. (1998b) and Bryan et al. (1999) show that the 6- 
values tend to decrease with increasing spatial resolution 
of the simulations, although they were able to explore the 
trend only for a limited range of box sizes. Meiksin & White 
(2001) expanded the box size and resolution ranges by per- 
forming pseudo-hydrodynamical A^-body simulations of the 
Lya forest. They found that convergence was particularly 
difflcult to achieve in the distribution of wavelet coefficients 
on the scale of 16 — 32kms~^ and in the 6 distribution. The 
results of their Particle-Mesh simulations suggest that the 
cumulative distributions may be determined only up to an 
absolute precision of 0.05 — 0.1, and the convergence does 
not always scale monotonically with resolution. The con- 
vergence behaviour, however, was found to improve when a 
pseudo-pressure force was added, and so may be better for 
a full hydrodynamics computation as well. They were still 
unable to reach convergence in the cumulative b distribution 
to better than an absolute precision of about 0.05 even in 
the pseudo-pressure force case, and allowing for as many as 
512^ grid zones to compute the gravity, a greater number 
than used in the simulations here. The question of conver- 
gence of the full hydrodynamics simulations to an absolute 
precision of better than 0.05 in the cumulative b distribu- 
tion we consider as yet unresolved. The general tendency, 
however, is to reduce the Doppler parameters at increased 
resolution, so that the discrepancy between the measured 
and predicted Doppler parameters is unlikely to result from 
a lack of convergence in the simulations. 

Another possibility is that there is missing physics in 
the simulations. Late Ife ii reionization will produce an in- 
crease in the temperature of the gas (Meiksin 1994; Miralda- 
Escude & Rees 1994; Haehneh & Steinmetz 1998; Abel & 
Haehnelt 1999). The possibility of late He ii reionization, at 
a redshift of zhoII = 3 — 3.5, has been suggested in a va- 
riety of contexts: on the basis of intergalactic He ii optical 
depth measurements compared with H i (Madau & Meiksin 
1994; Reimers et al. 1997); an abrupt change in the ion- 
ization parameter of metal lines at these redshifts (Songaila 
& Cowie 1996, but this has been disputed by Boksenberg, 
Sargent, & Ranch 1998); and determinations of an "effec- 
tive equation of state" of the IGM (McDonald et al. 2000a; 
Ricotti et al. 2000; Schaye et al. 2000). Since the opti- 
cally thin absorption originates preferentially in underdense 
regions (Zhang et al. 1998), these systems will not achieve 
thermal balance between photoionization heating and radia- 
tive cooling, and so will more strongly manifest the thermal 
effects of late He ii ionization (Meiksin 1994). Since the as- 
sociated heating will occur while the gas is out of ionization 
equilibrium, the temperature will be boosted by as much as a 
factor of 2 over the equilibrium case (Meiksin 1994). The low 
density optically thin absorbers will thereafter cool largely 
through adiabatic expansion, while the temperature of the 
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Figure 21. The effect of additional heat input due to late 
He II reionization on the predictions of the ACDMl model for 
Q1937— 1009. Shown are the cumulative distributions of the flux 
per pixel (top left panel), wavelet coefficients (top right panel; 
labelled as in Fig. H I column density (bottom left panel) and 
Doppler parameter (bottom right panel) . The effect of late reion- 
ization is mimicked by broadening the simulation optical depths 
by an additional 12kms~^ (dot-dashed lines), corresponding to 
the expected increase in temperature due to the onset of sudden 
He II reionization. The original results without the added broad- 
ening are also shown (dashed curves), as well as the direct mea- 
surements from the spectrum of Q1937— 1009 (solid curves). The 
extra broadening has very little effect on the flux per pixel distri- 
bution, but pronounced effects on the wavelet coeflicicnts and flt 
absorption line parameters. 



higher density gas will relax from its post-photoionization 
level to a lower value as a result of radiative losses. 

We may estimate the change in temperature imme- 
diately following He ii photoionization as follows. Prior to 
He II ionization, the total number density of particles 
is related to the number density of hydrogen nuclei nn by 
rip = 2(1 -I- 5)nH, where ^ = 0.25yHe/(l — i%o) is the ra- 
tio of helium nuclei to hydrogen nuclei, and the baryonic 
mass fraction of helium is Yhc = 0.24 (Izotov & Thuan 
1998). (We have assumed that virtually all the helium is 
in the form of He ii .) For a gas temperature T, the ther- 
mal energy density of the IGM is then = 3(1 + ^)nHkT, 
where k is the Boltzmann constant. After He ii ionization, 
rip = (2 -I- 3^)nH, and the thermal energy density will be 
e'j- — (3/2)(2 + 3^)nukT' , where T' is the gas temperature 
immediately following reionization. For a He ii heating rate 
GhoII per He ii ion and photoionization rate FhcII, the dif- 
ference in thermal energies is TiH^GHcii/rHoii — e'j- — er for 
(nearly) complete reionization. The increase in temperature 
is then 



rr' rji ^ i ^GhcII ^ &oII 

Sk 1 Hcii ik 



(5) 
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assuming 2 AT >> ^T' and ^ << 1. The heating per ion- 
ization efficiency r/Hdi = GHoii/lrHciifHoii), where fncii = 
54.4 eV is the ionization potential of He ii , is sensitive to 
the spectral shape of the ionizing radiation field, which will 
harden within the ionization front, since higher energy pho- 
tons will travel further than lower energy photons before 
being absorbed. This results in a boost in r/Heii, and an in- 
crease in AT. We find for a radiation field intensity cx z/~" 
within the ionization front that 0.35 < ?7hcII < 1-1 for 
2 > a > 0. Assuming a hard field with a ~ gives 77hcII ~ 1 
and AT « 1.7 x 10* K . 

We estimate the effects of an increase in temperature by 
broadening the optical depths in the ACDMl simulation by 
convolving with a gaussian of effective velocity dispersion 
(tt. (It may be argued that since the simulation already 
incorporates He ii heating, a smaller temperature increase 
should be adopted. On the other hand, the value for Q,f,h^ 
is smaller than the estimate of O'Meara et al. (2000), so 
that the gas temperature of the moderate density gas is too 
low, which partially offsets the additional heating. In any 
case, the estimate is only approximate, since it does not ac- 
count for the density dependence of the post-photoionization 
temperature or any dynamical changes that will result from 
the associated sudden change in thermal pressure.) The 
smoothing formally broadens the Doppler widths according 
to h' = {b^ + 2a^y^^ , where b and b' are the Doppler param- 
eters prior to and just after the temperature increase AT, 
respectively. We find that ax = 12kms~^ , corresponding 
to AT = 1.7 X 10'' K , provides adequate broadening. The 
broadening results in a slight increase in fa , requiring an in- 
crease in r to 5 X 10"^"^ s~^ (or 8 x 10~^^ s~^ re-normalizing 
to nth^ = 0.020 using bion) a.t z — 3.5 to match the mea- 
sured value of fa, in good agreement with the estimate of 
Haardt & Madau (1996). We allow for this increase (corre- 
sponding to a reduction in s in Table ^ from 2.56 to 2 .1) 
when re-synthesizing the model spectra. We show in Fig. bll 
the effects of the broadening on the flux per pixel distri- 
bution, wavelet coefficients, and absorption line parameters 
after synthesizing spectra to match that of Q1937— 1009, us- 
ing the same procedure as previously. The KS test compar- 
isons with Q1937— 1009 show excellent agreement with the 
distributions of flux per pixel {(Iks = 0.018, Pks = 0.003 
assuming fully independent pixels, and Pks = 0.23 assum- 
ing a reduction by a factor of 3 in their effective number), 
the wavelet coefficients on the scale of 17-34 km s~^ (^ks = 
0.023, Pks = 0.50) and higher, and the A^hi distribution 
(rfxs = 0.044, Pks ~ 0.28). Poor agreement is found for the 
lower velocity wavelet coefficients and the b distribution be- 
cause of excessive smoothing on small velocity scales. The 
predicted median 6-value (bmed = 27.6kms~^), however, 
now matches the measured. 

We remark that the additional smoothing has only a 
small effect on the pixel flux distribution, bringing it into 
slightly better agreement with the measured. Much larger 
effects are produced on the light fluctuations, as quan- 
tified by the wavelet coefficients and absorption line pa- 
rameters. While previously the predicted Ahi distribution 
for the ACDMl was strongly rejected by the KS test for 
Q1937— 1009, it now provides an excellent match. The pre- 
dicted number of absorption lines (453) now also agrees with 
the measured number (495). The decrease in number is a 
direct consequence of the broadening of the lines, as is the 



good match in the wavelet coefficients and the improved 
match in the Doppler parameters. The shape of the Ahi 
distribution, the number of absorption lines, the wavelet co- 
efficients and the 6-values are all linked, changing together 
in response to the broadening of the absorption features. 
We see, then, that it is because the CHDM model produces 
broader lines than the other models that it (usually) pro- 
vides the best match to these distributions. We caution that 
conclusions drawn on the basis of any one of these distribu- 
tions alone are thus precarious: if one of the distributions is 
poorly matched, whatever fixes it will alter the others. By 
contrast, the flux per pixel distribution is much less sensi- 
tive to changes in the widths of the absorption features, and 
thus provides a much more robust test of the cosmological 
model. 

An alternative explanation for the broader measured 
lines compared with the model predictions may be galac- 
tic feedback. One possibility is Compton heating by Ac- 
tive Galactic Nuclei (Madau & Efstathiou 2000). Ahhough 
Bryan & Machacek (2000) found the heating rate inade- 
quate, this may only be a reflection of the particular param- 
eters adopted for the rate and its evolution. Another possi- 
bility is heating or turbulent broadening by galactic winds. 
The possible role of galactic winds in enriching the IGM with 
metals has been considered by several authors (Wang 1995; 
Nath & Trentham 1997; Ferrara, Pettini & Shchekinov 2000; 
Madau, Ferrara & Rees 2000; Aguirre et al. 2001; Theuns, 
Mo & Schaye 2001). As part of the metal-enrichment pro- 
cess, winds from forming galaxies will inject energy into their 
surroundings. Although the temperature of the remnant 
wind material will decline with time after the winds cease 
(Madau et al. 2000), the galactic winds associated with on- 
going star-formation will provide a continuous source of en- 
ergy to the IGM, broadening the absorption lines either ther- 
mally or through turbulence driven by dynamical instabili- 
ties. Whether or not this is possible without severely disturb- 
ing the agreement with the A'hi distributions is unclear, but 
we may consider the energetics required to alter the Doppler 
parameter distribution. We estimate the energy density of 
the Lya forest as follows. The combined thermal and ki- 
netic energy per particle in gas giving rise to an absorp- 
tion feature with a Doppler parameter b is on the order of 
€p PS (3/4)mH&^, where mn is the mass of a hydrogen atom. 
Half the baryons are contained in Lya absorption systems 
with 13 < logio ^Hi < 15 at 2 < z < 4 (Miralda-Escude 
et al. 1996; Zhang et al. 1998). For an rms Doppler param- 
eter of & « 25 km s~^ , this gives a total (proper) energy den- 
sity in the Lya forest at z = 3 of CLya ~ 5 x 10~^^ erg cm~^. 
An increase in the Doppler parameter by 20 per cent would 
require an increase in the energy density by 40 per cent. We 
compare this with the possible amount of energy injected by 
supernovae. For a (proper) cosmic star formation rate den- 
sity at 2 > 3 of 1 - 10 M© yr"^ Mpc"^ (Madau et al. 1996; 
Steidel et al. 1999; Rhoads et al. 2001) that has been on- 
going for ~ 10® yr, a supernova rate of 1 per 100 Mq of 
stars formed, with a kinetic energy of 10^^ erg per super- 
nova, would be sufficient to produce a kinetic energy density 
input from star formation of 4 x 10^^^ erg cm^'^, allowing for 
an efficiency of 1-10 per cent for the transferal of the kinetic 
energy of the supernovae to the IGM. So it is not implausible 
for feedback from galaxies in the form of supernova-driven 
winds to stir the IGM sufficiently to produce a significant in- 
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crease in the widths of the absorption lines. It is possible the 
presence of the narrow (b < 15kms~^ ) absorption systems 
detected in several of the QSO spectra, of which some if not 
all are due to metals, is evidence for supernovae feedback in 
the form of enriched material. 

4.5 Gunn-Peterson effect 

The original goal of searches for Lya absorption in the spec- 
tra of QSOs was the discovery of intergalactic hydrogen, 
presumed to be uniformly distributed (Gunn & Peterson 
1965). The discovery of a uniform component to the IGM 
has proven largely elusive, although searches continue. These 
are pursued under a variety of strategies: the search for ad- 
ditional Lya absorption to what may be accounted for by 
Lya forest blanketing alone (Steidel & Sargent 1987a), ab- 
sorption in the spectra between Lya forest absorption lines 
(Giallongo, Cristiani, & Trevese 1992), or determinations 
based on the distribution of pixel flux (Webb et al. 1992; 
Fang & Crotts 1995). The measurements provide additional 
constraints on the models presented here. The results of 
these measurements have generally placed an upper limit on 
the optical depth of a smooth component of 0.05-0.1, with a 
few claimed (but model-dependent) detections of this order, 
over the redshift range 2 < 2 < 5. 

The models discussed here show that essentially all the 
structure of the IGM is in the form of fluctuations that 
give rise to absorption features, even in underdense regions 
(Zhang et al. 1998), although a residual smooth component 
may be detectable in minivoids. A small amount of residual 
absorption may be reconciled with BBN constraints on Qt 
and a photoionizing background dominated by QSO sources, 
provided the systems giving rise to the Lya forest have a 
characteristic line-of-sight scale length of Cs/H{z) « Lj, so 
that most of the baryons are contained in the Lya forest 
(Meiksin & Madau 1993). This is explicitly demonstrated 
by the simulations. We may estimate the amount of addi- 
tional absorption due to a smooth component as the differ- 
ence Ts — fa — Tofl, where r^f; is the effective optical depth 
due to line-blanketing, given over a redshift interval Az by 
Toff = (l/Xa) w°^^ /Az, where the sum is carried over the 
observed equivalent widths of the lines and Xa is the (rest) 
wavelength of Lya (Meiksin & Madau 1993; Press, Rybicki 
& Schneider 1993). The mean and effective optical depths 
measured from the spectra in Table ^ (with metal lines re- 
moved as discussed in Section ^ are shown in Figure |22[ The 
differences in optical depth range over 0.01 < Ts < 0.05. The 
predictions of the ACDMl model are also shown. We find 
Ts < 0.1 at 2 ~ 3.5, diminishing nearly to zero at lower red- 
shift. Thus there is very little absorption present in both the 
measured and the model spectra that may not be accounted 
for by the Lya forest alone. Similar results are found for the 
other models. 

We also show in Figure ^ estimates for fa from Press, 
Rybicki & Schneider (1993) and Zhang et al. (1997), who 
based their estimates on the mean absorption measurements 
of Steidel & Sargent (1987b) and Zuo & Lu (1993). These 
measurements have been used in the past for estimating the 
amount of any residual optical depth after removing the ef- 
fect of line-blanketing. The estimates of Press et al. (1993) 
lie systematically high, and those of Zuo & Lu (1993) lie sys- 
tematically low, compared with the values we derive. Our 
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Figure 22. (Upper panel) Comparison between the total mean 
optical depth fa (solid bars) and the effective optical depth 
T(,ff (long dashed bars) due to line-blanketing by the Lya forest 
measured in the QSO spectra in Tabic ^ (The width of a bar 
indicates the redshift interval over which the optical depth is 
averaged.) Also shown are estimates for fa using data from 
Steidel & Sargent (1987b) (dotted line) and Zuo & Lu (1993) 
(dot-dashed line), the estimate of Press et al. (1993) (short 
dashed line), and the estimate of Kim et al. (2001) (dot-long 
dashed line). (Lower panel) The predictions of the ACDMl model 
for fa (solid symbols) and Toff (open symbols) for the spec- 
tra. The points near ^ = 3 have been slightly offset in red- 
shift for clarity. The symbols correspond to Q1937— 1009 (cir- 
cles), HS 1946+7658 (triangles), Q0014-I-813 (inverted triangles), 
Q0302-003 (squares), Q0636-I-680 (diamonds), Q0956-I-122 (pen- 
tagons), J2233— 606 (hexagons). 

values agree closely with those derived from the measure- 
ments of Steidel & Sargent (1987b), and with the recent de- 
termination by Kim, Cristiani & D'Odorico (2001). Adopt- 
ing the values of Press et al. (1993) would in particular 
result in an overestimate of any residual optical depth be- 
yond that due to line-blanketing. 



5 SUMMARY 

We compare measurements of the Lya forest in seven high 
resolution, high signal-to-noise ratio QSO spectra with pre- 
dictions of hydrodynamical simulations for a variety of CDM 
dominated cosmologies. Spectra are synthesized from the 
simulations matching the observed spectra in wavelength 
coverage, pixelization, spectral resolution, and signal-to- 
noise properties. The statistical comparisons between the 
model and measured spectra are based on the distribution 
of flux per pixel and the statistics of the light fluctuations 
in the spectra as quantified by a wavelet analysis and by 
Voigt profile fitting to absorption features. Although all the 
models provide reasonably good qualitative descriptions of 
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the measured Lya forest properties, we find varying levels 
of agreement with the data depending both on the cosmo- 
logical model and the basis of comparison. In general, no 
single model is able to pass all the tests. The most accurate 
agreement is obtained for the flux per pixel distributions, 
with differences between the predicted and measured cumu- 
lative flux distributions as small as cZks < 0.02, close to the 
precision limit of the simulations and the measurements. Al- 
though the KS test formally rejects (at the 3cr level) almost 
all the predicted flux distributions in comparison with the 
measured, an analysis of the probability distribution of dKS 
shows evidence for significant correlations in the flux values 
of neighbouring pixels. Allowing for the indicated reduction 
in the effective number of independent pixels results in ac- 
ceptable fits provided by the ACDM and SCDM models, 
while the remaining models are strongly rejected. We find 
that the slope of the flux per pixel distribution systemati- 
cally decreases with increasing aj, the density fluctuations 
on the scale of the cosmological Jeans length. Agreement 
between the predicted and measured flux per pixel distri- 
butions imposes the restriction 1.3 < aj < 1.7 (better than 
3(t), normalized at « = 3. 

The predicted light fluctuations, as quantified by the 
wavelet analysis and fit absorption line parameters, are gen- 
erally in poor agreement with the data. The most strin- 
gent statistical constraints are set by the distributions of 
wavelet coefficients, because of the large number of pixels 
available. (Correlations between the wavelet coefficients arc 
weak or absent.) An advantage of the wavelet analysis over 
the more traditional Voigt profile analysis is that the orthog- 
onality of the wavelets permits a separation of the velocity 
scales contaminated by metal lines from those dominated 
by Lya absorption features. A further advantage is that 
it is extremely computationally cheap relative to a Voigt 
profile analysis, providing a very inexpensive means of test- 
ing model predictions for the statistics of the light fiuctua- 
tions to high precision. The velocity range of approximately 
15-30 km s~^ is of special interest since it corresponds to 
the velocity broadening of hydrogen absorption lines at the 
characteristic temperature of photoionized gas. Only the 
CHDM model is able to provide an acceptable match to 
the measured coefficients on this scale; all other models arc 
strongly rejected. A confiict is thus presented in relation to 
the flux distribution: models that match well the measured 
flux per pixel distributions provide unacceptable matches to 
the wavelet coefficients, while those that are in agreement 
with the mcEisured wavelet coefficients produce unacceptable 
pixel flux distributions. The agreement at smaller velocity 
scales is mixed, possibly due to contamination by metal ab- 
sorption lines. 

Voigt profile analyses arc also performed, as they relate 
the light fiuctuations to quantities of direct physical inter- 
est, the column densities and velocity widths of the absorp- 
tion features. We find that while good agreement is often 
found between the predicted and measured A^hi distribu- 
tions, including for models that produce satisfactory flux 
per pixel distributions, the agreement is mixed, depending 
on the analyis algorithm and the data. When agreement is 
found, it is often marginal. The agreement generally tends 
to favour the CHDM model, as for the wavelet coefficients. 
The same is true for the total number of absorption features. 
The models other than CHDM usually predict far too many 



absorption lines compared with the number measured in the 
spectra. The model predictions for the Doppler parameters 
fare much worse. The predicted median Doppler parameters 
are as much as 60 per cent smaller than the measured. As 
for the wavelet coefficients, the best agreement is generally 
provided by the CHDM model, although the predicted dis- 
tributions are generally statistically unacceptable. 

We find considerable inhomogeneity in the published 
data, particularly on the 15-30 km s^^ scale. Doppler param- 
eter distributions independently derived from separate spec- 
tra covering nearly the same redshift range show statistically 
significant differences. This may lead to an exaggeration in 
the discrepancy between the Doppler widths found in sim- 
ulations and the measured values appearing in some of the 
literature. 

A breakdown of the absorption lines into subsamples 
optically thin and thick at the Lya line centre shows that the 
discrepancies between the predicted and measured absorp- 
tion line parameters originate primarily from the optically 
thin absorbers. The iVni distributions predicted for the op- 
tically thick systems by most of the models are generally in 
good agreement with the measured distributions, as are the 
numbers of optically thick lines. Much poorer agreement is 
found for the A^hi distributions and line numbers for the op- 
tically thin absorbers. The Doppler parameters show a sim- 
ilar trend. The 6-values of the optically thick absorbers are 
in much better accord with the data than those of the opti- 
cally thin systems. The CHDM model again tends to provide 
the best agreement for both the h distributions and the total 
number of optically thin systems, as it did for the wavelet co- 
efficients on the 16-32 km s~^ scale. This is no surprise, as all 
these distributions are related. The CHDM model produces 
broader features than the other models, so that fewer lines 
are required to reproduce the measured mean optical depth. 
The fewer lines are reflected by proportionally fewer large 
wavelet coefficients (fewer large light fluctuations), agreeing 
more closely with the measured coefficient distributions. 

We consider a few possible explanations for the dis- 
crepant results for the light fiuctuations. The most likely 
is perhaps late reionization of He ii , at zhcII ~ 3.5. Late 
rcionization will increase the temperature of the gas by an 
amount AT « 1.5 — 2 x 10* K . While the temperature of 
overdense gas will relax to lower values as a result of radia- 
tive losses, the lower density gas giving rise to the optically 
thin systems will retain a stronger memory of the boost in 
temperature, because it is too rarefied to radiate efficiently. 
We estimate the impact of the temperature boost for the 
ACDMl model predictions for Q1937-1009 by convolving 
the simulation optical depths with a gaussian of velocity 
width ot ~ 12 kms^^ , corresponding to AT = 1.7 x 10'* K . 
Excellent agreement is produced for the distributions of 
pixel flux, wavelet coefficients (on the 17-34 km s"'^ scale), 
H I column density, and the median Doppler parameter. The 
extra broadening of the features has very little effect on the 
flux distribution, suggesting that the flux per pixel distribu- 
tion provides a robust test of the models that is insensitive 
to uncertainties in the modelling of the reionization history 
of the IGM. We also consider the possible impact of galac- 
tic winds on the IGM and show that they too may plausibly 
provide sufficient broadening of the absorption features, pro- 
vided they are able to propagate throughout the low density 
regions which give rise to the optically thin systems. 
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We caution against the use of any single distribution 
quantifying the light fluctuations, such as the wavelet co- 
efficients, the H I column density, the Doppler parameters, 
or the number of absorption lines, as a test of the models. 
These quantities are all related, so that if a model fails to re- 
produce any one of them, whatever alterations to the model 
are necessary to gain accord with the data will disturb any 
agreement found in the remaining quantities. 

The requirement that the models match the measured 
mean optical depths of the Lya forest places a constraint 
on the H i photoionization rate. The models presented re- 
quire an ionization rate that lies within a factor of 2 of that 
predicted for a UV background dominated by QSO sources. 
A large contribution from additional sources of ionizing ra- 
diation would place the models in jeopardy by requiring an 
implausibly high baryon density. We note, however, that the 
required photoionization background declines at z > 3.1 sig- 
nificantly more slowly than predicted by Haardt & Madau 
(1996) for a background dominated by QSO sources, based 
on current QSO counts. The mean optical depths obtained 
by the models are dominated by line-blanketing, with a 
residual optical depth resulting from a uniform component 
of at most 0.1 at z ~ 3.5, and becoming vanishingly small 
by 2 ~ 2. 
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APPENDIX A: TESTS OF ANALYSIS 
PROCEDURES 

In this section we summarise several tests of the analysis pro- 
cedures: the redshift interpolation method, the noise models, 
the goodness-of-fit of the absorption line analyses, and the 
applicability of the KS test. 



Al Redshift interpolation 

We test the redshift interpolation method using the distri- 
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butions found in the ACDMh simulation. In Figure 
effect of rescaling the flux distributions at 2: = 3 and z — 4 
to redshifts z = 4 and z — 3, respectively, is shown using 
the values for a in Table ^ and equation (No additional 
evolution in the radiation field was assumed; p = 0.) The 
result of rescaling the spectra at z = 3 and z = 4 both to 
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Table 1. Parameters for the cosmological models. f2jvf is 
the total mass density parameter, Ha the cosmological con- 
stant density parameter, Vl^ the baryonic mass fraction, h = 
Ho/100 km s~^ Mpc~^ , where Hq is the Hubble constant at 
z = 0, n the slope of the primordial density perturbation power 
spectrum, crg^_i the density fluctuation normalization in a sphere 
of radius Sh~^ Mpc, and ctj the density fluctuation at 2 = 3 on the 
scale of the cosmological Jeans length (see text). For the CHDM 
model, two massive neutrino species are assumed accounting for 
a mass fraction of = 0.2. 
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Table 2. The QSO sample. The indicated redshift range is the 
portion of the spectrum adopted for the analysis. The pixel reso- 
lution (as A/AA or pixel width) and spectral resolution (FWHM 
in kms~^ ), correspond to this region. The observations were re- 
ported in: Q1937-1009: Buries & Tytler (1997); HS 1946+7658: 
Kirkman & Tytler (1997); Q0014+813, Q0302-003, Q0636+680, 
Q0956+122: Hu, Kim, Cowie & Songaila (1995); J2233-606: Cris- 
tiani & D'Odorico (2000). 



z = 3.6 and then linearly weighting the resulting flux distri- 
butions to 2: = 3.6 using equation H is shown by the central 
dashed curve. The difference between the direct and inter- 
polated cumulative distributions ai z — 3.6 is at most 0.006, 
and indicates the level of accuracy of the modelling of the 
measured spectra. A similar level of accuracy is found for 
the cumulative distributions of the wavelet coefficients. The 
direct and interpolated distributions of the absorption line 
parameters, however, agree somewhat more poorly. While 
the cumulative distributions generally agree to within a dif- 
ference of 0.01, portions may disagree by as much as 0.03 
for the A^Hi distribution and 0.04 for the b distribution at 
intermediate values, in the sense that the interpolated dis- 
tributions exceed the directly estimated distributions. These 
differences are comparable to the accuracy to which the dis- 
tributions may be measured using a single QSO spectrum 
and represent a limitation of the modelling of the distribu- 
tions. Since it was based on a near maximum amount of 
interpolation to an intermediate redshift between the data 
dump redshifts, we expect that the actual modelling is some- 
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Table 3. Evolution exponents, continuum ofTsets, and rescal- 
ing coefficients for the predicted distributions of flux per pixel 
to match that measured in Q1937— 1009. The value for a corre- 
sponds to the redshift interval 3 < z < 4. 
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U! 


s 


CHDM 
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0.005 
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0.85 


tCDM 
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0.04 


0.59 



Table 4. Evolution exponents, continuum offsets, and rescaling 
coefficients for the predicted distributions of flux per pixel to 
match that measured in HS 1946+7658. The value for a cor- 
responds to the redshift interval 2 < z < 3. 



what better. Nevertheless, in the future a greater number of 
intermediate data dump redshifts would be desirable. 



A2 Absorption line goodness-of-fit 

Because of the necessarily automated nature of the absorp- 
tion line parameter analyses, it is important to maintain 
a check on the acceptability of the fits. This was done by 
tabulating the distributions of for the fit line complexes 
for all of the analyses. Because the number of degrees-of- 
freedom ndof differs for different line complexes, the quan- 
tity tabulated is (x^/wdof — l)/(2/ndof)^'''^, which in the limit 
JT-dof — > oo is distributed like a gaussian with zero mean and 
unit variance (Kendall & Stuart 1969). The resulting cumu- 
lative distributions for the ACDMl model predictions for 
Q1937— 1009 (assuming no conti nuum offset) are shown for 
Auto VP and SPECFIT in Figure A2, which are representa- 
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tive of all the analyses. The fits are generally acceptable, 
although both line analysis methods produce a tail of high 
values. The low median value of produced by AutoVP 
is a consequence of the demand for a reduced x^ of 0.5 for 



Table 5. Continuum offsets and rescaling coefficients for the 
predicted distributions of flux per pixel to match those mea- 
sured in Q0014-I-813, Q0302-003, Q0636+680, Q0956-I-122, and 
J2233-606. 

a fit to be acceptable. For SPECFIT, the requirement was a 
reduced x^ of 1. 



A3 KS test 

The acceptability of a hypothesis using the KS test is de- 
termined by the probability of exceeding the value N^^^dKS, 
where (Iks is the maximum difference (in absolute value) be- 
tween the cumulative distribution of some parameter mea- 
sured independently A*' times, and the predicted distribu- 
tion. The probability distribution for exceeding N^^^dKS in 
the flux per pixel distribution derived from the simulated 
spectra of the ACDMl model (with no continuum offset), 
using as the fiducial the average distribution of flux per 
pixel predicted for Q1937— 1009, is shown in Figure A3. The 
distribution of A'^^'^^dKS derived from the simulated spectra 
is broader than the theoretical KS probability distribution. 
Also shown are the probability distributions of N^^^dKS 
based on the distributions of A^hi and b from the AutoVP 
analysis. In contrast to the case for the flux, these distribu- 
tions are much narrower than the theoretical distribution. 

The reasons for the discrepancies in the distributions 
are unclear. Since absorption features typically run over sev- 
eral pixels, individual pixel fluxes will not be statistically 
independent, as is required for the validity of the KS test. 
Similarly, since the absorption line analysis procedure fits 
nearby lines simultaneously, the values found for the fit pa- 
rameters are not truly independent. 
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Figure Al. Test of the interpolation method for the flux distri- 
bution, for the ACDMjj model. The left pair of curves show the 
flux distribution at z = 4 (solid curve) and the predicted distri- 
bution based on rescaling the model spectra at 2 = 3 according to 
equation |l| (dashed curve). The right pair of curves show the flux 
distribution at 2 = 3 (solid curve) and the predicted distribution 
from rescaling the spectra at z = 4 (dashed curve). The central 
pair show the flux distribution at z = 3.6 (solid curve) and the 
linearly interpolated distribution based on rescaling the spectra 
at z = 3 and z = 4 and using equation ^ (dashed curve). The 
predicted and actual distributions at z = 3.6 agree to within 0.4 
per cent. 

We examine the possible effect of correlations in flux 
and in the derived absorption parameter values using a set 
of Monte Carlo simulations of spectra. We model the statis- 
tical distribution of the line positions, H i column densities, 
and Doppler parameters based on the distributions reported 
in Kirkman & Tytler (1997) (see Appendix ^ below). The 
resulting probability distribution of A^'^'^^dKS for the flux, 
shown in Figure is again found to be broader than the 
KS distribution, and even broader than the distribution de- 
rived from the simulation. This suggests that strong corre- 
lations in the pixel fluxes are present. Allowing for a reduc- 
tion by a factor of 3 in the effective number of independent 
flux measurements shifts the theoretical KS probability dis- 
tribution into good agreement with that derived from the 
simulation, while a reduction by a factor of 5 produces good 
agreement between the theoretical probability distribution 
and that measured from the Monte Carlo realizations. The 
resulting probability distributions of N^^^dKS based on the 
derived values of A^hi and b using AutoVP are found to be 
narrower than the KS distribution, although not by as much 
as the simulation results. Once again, correlations appear to 
be present, but tending to reduce the scatter in the derived 
values of the absorption line parameters. 

An element of statistical dependence in the simulation 
results may also arise because the lines-of-sight sampled in 
the simulation are not truly statistically independent since 



Figure A2. Distribution of fo'' the absorption line fits us- 
ing AutoVP (solid curve) and SPECFIT (dashed curve), for 
ACDMl- Shown are the cumulative distributions of (x^/'^dof ~ 
l)/(2/ndof)l/2, where njof is the number of degrees-of-freedom 
of the fit. This quantity should be distributed like a gaussian with 
vanishing mean and unit variance in the limit njof — > 00 (dotted 
curve). Although most of the fits are acceptable, both methods 
produce too great a tail at large values. 

they are drawn from the same simulation, and so may give 
a biased estimate of the correct distribution function with 
a reduced dispersion. This may only be tested using lines- 
of-sight that are spaced adequately to probe separate struc- 
tures, or by a large number of repeated simulations. Neither 
is currently computationally feasible using full hydrodynam- 
ical simulations, so that this possibility cannot be tested at 
present. 

Still another possibility is that the properties of nearby 
features are physically related. Such correlations are ex- 
pected for neighbouring lines-of-sight as a consequence of 
the linear continuity of the gas density deflning the fila- 
ments. Similar correlations may exist among density fluc- 
tuations across the filaments. In this paper, we quote formal 
KS probabilities, noting that these appear to underestimate 
the true probabilities for the flux distribution comparisons 
and to be conservative estimates of the probabilities for the 
comparisons of the absorption line parameter values. Possi- 
bly more realistic estimates of the flux per pixel distribution 
KS-test probabilities are provided by reducing the effective 
number of pixels by a factor of 3-5, and so these values are 
also quoted. 



APPENDIX B: COMPARISON OF AutoVP AND 
SPECFIT 

In this Appendix, the line finding and fitting algorithms used 
in this paper, AutoVP and SPECFIT, are compared using 
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Figure A3. Test of the distribution of probability of exceeding 
a given value of A'^^/^i^kSi where c^ks the maximum differ- 
ence between the cumulative distribution of A'^ values of flux (top 
panel), TVui (middle panel), and b (bottom panel), and the ex- 
pected distributions, using the predictions of the ACDMl model 
for Q1937— 1009. The probability distribution derived from the 
simulation (solid histograms) is broader for the flux, and nar- 
rower for the absorption line parameters, than the respective KS 
probability distributions (short dashed curves). The distributions 
for a set of Monte Carlo realisations of the hya forest are also 
shown (long dashed curves). Also shown in the top panel are the 
KS probability curves assuming a reduction in the effective num- 
ber of independent pixels by a factor of 3 (dotted curve) and 5 
(dash-dotted curve). 



Monte Carlo realisations of spectra. The algorithm for Au- 
toVP is described by Dave et al. (1997). SPECFIT performs 
its analysis through the following steps: 1. filter the spectrum 
using wavelets, discarding the smallest wavelet coefficients 
to ensure a reduced of 1 between the filtered and unfil- 
tered spectrum; 2. identify candidate features as inflection 
points by computing a smoothed second derivative of the 
spectrum; 3. define a spectral region to be fit about each 
candidate line as a contiguous region with the flux smaller 
than a given exp(— Tmin); 4. merge overlapping regions into 
single regions, provided the number of candidates in a single 
region is not too great, in which case it is split (at most 16 
candidate lines were allowed in a single fit region in the anal- 
yses of this paper); 5. perform a non-linear least squares fit 
of the candidate lines to the original (unfiltered) spectrum. 

The spectra are constructed from discrete lines with 
Voigt profiles using the H i column density and Doppler pa- 
rameter distributions found by Kirkman & Tytler (1997). 
Specifically, the H i column densities A^hi are drawn inde- 
pendently from a power law distribution of slope 1.5 be- 
tween 12.5 < logig A'hi < 16 and the Doppler parameters 
b from a gaussian with mean 23 kms~^and standard de- 
viation 14 kms^^ . A cut-off in h is imposed according to 
6 > 14 + 4(logioiVHi - 12.5) kms"^ . The resulting aver- 



Figure Bl. The recovered H I column density and Doppler pa- 
rameter distributions from a set of Monte Carlo realizations. The 
light solid curves show the results of AutoVP while the dashed 
curves show the results of SPECFIT. The heavy solid lines show 
the input model distributions. 



age Doppler parameter is 31 kms~^ . The number density 
of lines per unit redshift matches that of Kirkman & Tytler 
(1997) at z = 3. The resolution is set at \/d\ = 5 x 10'', and 
gaussian noise is added to give a continuum signal-to-noise 
ratio per pixel of 50. 

The two methods are found to have relative advantages 
and disadvantages. AutoVP faithfully recovers the cut-off in 
the H I column density distribution at log^Q A'^hi = 12.5, 
while SPECFIT produces an excess of low column density 
systems. At higher column densities (13 < logjQ TVhi < 15), 
however, SPECFIT is found to better recover the input col- 
umn density distribution. 

By contrast, AutoVP well reproduces the input Doppler 
parameter distribution, although it produces a small tail at 
low values. SPECFIT similarly produces a low b tail, and 
somewhat underpredicts the number of intermediate b sys- 
tems. 

It is evident that neither algorithm perfectly recovers 
the input parameter distributions. No improvement in the 
signal-to-noise ratio would yield better results: the discrep- 
ancy arrises from the unavoidable blending of features. In 
practice, provided the same algorithm is applied to the data 
and the model spectra, the resulting fits may still be used 
as a viable statistical description of the light fiuctuations in 
the spectra. 
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